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Abstract. Let {Xji^)j k^i be i.i.d. complex random variables such that is in the domain 

of attraction of an o-stable law, with < a < 2. Our main result is a heavy tailed counterpart 
of Girko's circular law. Namely, under some additional smoothness assumptions on the law of 
Xjfc, we prove that there exist a deterministic sequence a„ ~ n^/" and a probability measure 
fia on C depending only on a such that with probability one, the empirical distribution of 
the eigenvalues of the rescaled matrix {a,n^ Xjk)l^i,k^n converges weakly to /la as n — >■ oo. 
Our approach combines Aldous & Steele's objective method with Girko's Hermitization using 
logarithmic potentials. The underlying limiting object is defined on a bipartized version of 
Aldous' Poisson Weighted Infinite Tree. Recursive relations on the tree provide some properties 
of fia- In contrast with the Hermitian case, we find that fia is not heavy tailed. 
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1. Introduction 

The eigenvalues of annxn complex matrix M are the roots in C of its characteristic polynomial. 
We label them Ai(M), . . . , A„(Af) so that |Ai(M)| > ••• ^ |A„(M)| ^ 0. We also denote by 
si(M) ^ ■ ■ Sn{M) the singular values of M, defined for every 1 < fc ^ n by Sk{M) := 
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\k{VMAI*) where Ad* — M is the conjugate transpose of M. We define the empirical spectral 
measure and the empirical singular values measure as 

n 1 

" X! ^^i' (^^) ^^'^ = ~ X! ( A^) ■ 

fe=l k=l 

Let {Xij)ij-^i be i.i.d. complex random variables with cumulative distribution function F . Consider 
the matrix X = (Xij)i^i.j<j„. Following Dozier and Silverstein (201 H], if F has finite positive 
variance cr^, then for every z e C, there exists a probability measure Qa,z on [0,oo) depending 
only on a and z, with explicit Cauchy-Stieltjes transform, such that a.s. (almost surely) 

v-^x-zi 27 Q-y,^ (1-1) 

where denotes the weak convergence of probability measures. The proof of (|l.ip is based on 
a classical approach for Hermitian random matrices with bounded second moment: truncation, 
centralization, recursion on the resolvent, and cubic equation for the limiting Cauchy-Stieltjes 
transform. In the special case z — the statement (|l.ip reduces to the quarter-circular law 
theorem (square version of the Marchenko-Pastur theorem, see [371 ESI El]) and the probability 
measure Qa,o is the quarter-circular law with Lebesgue density 



X I— > 

TTcr 



^V4^^^1[o,2.](a;). (1.2) 



Girko's famous circular law theorem [25 states under the same assumptions that a.s. 

^ (1.3) 



where TAa is the uniform law on the disc {z G C; |z| ^ a}. This statement was established through 
a long sequence of partial results [311[ll[ini[Ml[lIl[251ll[2ZlEllinil2HlllHlED],the general case 
(|1.3I) being finally obtained by Tao and Vu [50] by using Girko's Hermitization with logarithmic 
potentials and uniform integrability, the convergence (11.11) , and polynomial bounds on the extremal 
singular values. 

1.1. Main results. The aim of this paper is to investigate what happens when F does not have 
a finite second moment. We shall consider the following hypothesis: 

(HI) there exists a slowly varying function L (i.e. limf_s.oo L{x t) / L{t) = 1 for any a; > 0) and a 
real number a € (0, 2) such that for every t ^ 1 



\Xii\^t)= f dF{z)^L{t)r 

J{zeC;|z|>t} 



and there exists a probability measure 6 on the unit circle S"'^ := {z e C; \z\ = 1} of the 
complex plane such that for every Borel set _D C 



\Xii\^t] =0{D) 



lim P ( T^iir e D 

t^oo V 1^11 1 

Assumption (HI) states a complex version of the classical criterion for the domain of attraction 
of a real a-stable law, see e.g. Feller [231 Theorem IX. 8. la]. For instance, if Xn = Vi + iV2 with 
i = \f—\ and where Y\ and V2 are independent real random variables both belonging to the domain 
of attraction of an a-stable law then (HI) holds. When (HI) holds, we define the sequence 

a„ := inf{a > s.t. nP(|Xii| ^ a) ^ 1} 

and (HI) implies that lim„_j.oo "■IP(|-'fii| ^ o-n) = ^vm^n^oo na^" L{an) — 1. It follows then clas- 
sically that a„ = n^/"£{n) for every n ^ 1, for some slowly varying function £. The additional 
possible assumptions on F to be considered in the sequel are the following: 

(H2) P(|Xii| ^ t) --t^oo ct-" for some c> (this implies a„ --„-^oo c^/^n^/") 

(H3) Xii has a bounded probability Lebesgue density on R or on C. 
One can check that (H1-H2-H3) hold e.g. when |Xii| and Xii/|Xii| are independent with \Xii\ = 
\S\ where 5* is real symmetric a-stable. Another basic example is given by Xii = £l^-i/" with £ 
and W independent such that e takes values in and W is uniform on [0, 1]. 

For every n ^ 1, let us define the i.i.d. n x n complex matrix A = An by 

A, := a-'X,j (1.4) 
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for every I ^ i, j ^ n. Our first result concerns the singular values of ^4 — zl, z G C. 

Theorem 1.1 (Singular values). // (HI ) holds then for all z € C, there exists a probability measure 
Va,z on [0, oo) depending only on a and z such that a.s. 

n— >C30 

The case z — Q was already obtained by Belinschi, Dembo and Guionnet [B]. Theorem 11.11 is 
a heavy tailed version of the Dozier and Silverstein theorem (jl.ip . Our main results below give 
a non-Hermitian version of Wigner's theorem for Levy matrices [Ml [71 [6l [11], as well as a heavy 
tailed version of Girko's circular law theorem (11.31) . 

Theorem 1.2 (Eigenvalues). // (H1-H2-H3) hold then there exists a probability measure fj,a on C 
depending only on a such that a.s. 

n^oQ 

Theorem 1.3 (Limiting law). The probability distribution fia from theorem ] is isotropic and 
has a continuous density. Its density at z = equals 

r(l + 2/a)^r(l + a/2)^/" 
7rr(l - a/2)2/" ■ 
Furthermore, up to a multiplicative constant, the density of fia is equivalent to 

|^|2(a-l)g-f |zr ^ 

Recall that for a normal matrix (i.e. which commutes with its adjoint), the absolute value of the 
eigenvalues are equal to the singular values. Theorem 11.31 reveals a striking contrast between 
and Va.o- The limiting law of the eigenvalues has a stretched exponential tail while the limiting 
law Vafl of the singular values is heavy tailed with power exponent a, see e.g. [Q. This does not 
contradict the identity Jlfc^i l-^fe(^)| = nfc=i'5fc(^)' ^'^^ does indicate that A is typically far 
from being a normal matrix. A similar shrinking phenomenon appears already in the finite second 
moment case (|l.ip - p.3p : the law of the absolute value under the circular law U^r has density 

r 1-^ 2o-"^rl[o^cr](r) 

in contrast with the density (jl.2p of the quarter-circular law Qa-.o, even the supports differ by a 
factor 2. 

The proof of theorem 11.11 is given in section 12.81 It relies on an extension to non-Hermitian 
matrices of the "objective method" approach developed in [TT] . More precisely, we build an explicit 
operator on Aldous' Poisson Weighted Infinite Tree (PWIT) and prove that it is the local limit 
of the matrices An in an appropriate sense. While Poisson statistics arises naturally as in all 
heavy tailed phenomena, the fact that a tree structure appears in the limit is roughly explained 
by the observation that non vanishing entries of the rescaled matrix An = o^n^X can be viewed 
as the adjacency matrix of a sparse random graph which locally looks like a tree. In particular, 
the convergence to PWIT is a weighted-graph version of familiar results on the local structure of 
Erdos-Renyi random graphs. 

The proof of theorem 11.21 is given in section [31 It relies on Girko's Hermitization method with 
logarithmic potentials, on theorem ll.il and on polynomial bounds on the extremal singular values 
needed to establish a uniform integrability property. This extends the Hermitization method to 
more general settings, by successfully mixing various arguments already developed in [TTl [T^ 150) . 
Following Tao and Vu, one of the key steps will be a lower bound on the distance of a row of the 
matrix A to a subspace of dimension at most n — n^~^ , for some small 7 > 0. 

Girko's Hermitization method gives a characterization of /io, in terms of its logarithmic potential 
(see appendix[Xl) . In our settings, however, this is not convenient to derive properties of the measure 
/Xq,, and our proof of theorem 11.31 is based on an analysis of a self-adjoint operator on the PWIT 
and a recursive characterization of the spectral measure from the resolvent of this operator. This 
method is explained in section [51 while the actual computations on the PWIT are performed in 
section [H 

Let us conclude with some final remarks. Following |16) . the derivation of a Markovian version 
of theorems 11.11 and 11.21 is an interesting problem, see [TOl E] for the symmetric case and [T51 [T^ 
for the light tailed non-symmetric case. In another direction, it is also tempting to seek for an 
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interpretation of Ua.z and in terms of a sort of graphical free probability theory. Indeed, our 
random operators are defined on trees and tree structures are closely related to freeness. Also, 
with a proper notion of trace, it is possible to define the spectral measure of an operator, see e.g. 
[T51 15T1 . However these notions are usually defined on algebras of bounded operators and we 
will not pursue this goal here. Note finally that theorems 11.11 and 11.21 remain available for additive 
perturbations of finite rank, by following the methodology used in [171 EOl Hi] ■ 



1.2. Notation. Throughout the paper, the notation n 3> 1 means large enough n. For any 
c G [0, oo] and any couple /, g of positive functions defined in a neighborhood of c , we say 
that f{t) ~ g{t) as t goes to c, if \imt^cf{t)/g{t) — 1. We denote by I?'(C) the set of Schwartz- 
Sobolev distributions endowed with its usual convergence with respect to all infinitely difi^erentiable 
functions with bounded support C^{C). We will consider the differential operators on C ~ M^, 
for z = X + iy (here i = 

d^^{dx~idy) and d^^{dx + idy). 
We have dz ^ dz = 0, dz = dz ~ 1 and the Laplace differential operator on C is given by 



We use sometimes the shortened notation A — z instead of yl — zl. 



A = Add = dl + d;. 



2. BiPARTIZED RESOLVENT MATRIX 

The aim of this section is to develop an efficient machinery to analyze the spectral measures 
of a non-hermitian matrix which avoids a direct use of the logarithmic potential and the singular 
values. Our approach builds upon similar methods in the physics literature jUl [211 HH 133] ■ 

2.1. Bipartization of a matrix. Let n be an integer, and A be a n x n complex matrix. We 
introduce the symmetrized version of va-z-, 



1 " 

k=l 

Let C+ = {z E C : 3m{z) > 0} and consider the quaternionic-type set 



For z e C, ?7 e C+ and 1 ^ i, j ^ n integers, we define the elements of H+ and Al2(C) respectively. 

We define the matrix in A4„(A^2(C)) ~ A^2n(C), B — Since B*^ — Bij, as an 

element of A^2n(C), B is an Hermitian matrix. Graphically, the matrix A can be identified with 
an oriented graph on the vertex set {1, • • • , n} with weight on the oriented edge equal to 

Aij. Then, the matrix B can be thought of as the bipartization of the matrix A, that is a non- 
oriented graph on the vertex set {1, —1, • • • , —n, n}, for every integers I ^ i, j ^ n the weight on 
the non-oriented edge {i, —j} is Aij, and there is no edge between i and j or — i and ~j. 

For U e H+, let U ® In E Mn{M2{'C)) be the matrix given by {U ® In)ij = SijU, 1 < i, j ^ n. 
The resolvent matrix is defined in A^„(A^2(C)) by 

RiU) - iB-U®In)-\ 

so that for all I ^ i, j ^ n, R{U)ij E Al2(C). For 1 ^ fc ^ n, we write, with U = U{z,ri), 

mu = hM ^^'^^X (2.1) 



The modulus of the entries of the matrix R{U)kk are bounded by {3ra{r]))^^ (see the forthcoming 
lemma 1121). 

As an element of A^2n(C), R is the usual resolvent of the matrix 

B{z) = B -U{z,{))®In. 
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Indeed, with U — U(z, rj), 

R{U) ^ {B{z) ^ 7^hn)-\ (2.2) 
In the next proposition, we shah check that the eigenvalues of B[z) are ±(Tfc(A — z), 1 ^ fc ^ n, 
and consequently 

^^B(z) = VA-z- (2.3) 

It will follow that the spectral measures ha and va-z can be easily recovered from the resolvent 
matrix. Recall that the Cauchy-Stieltjes transform of a measure on M is defined, for 77 G C+ , as 

mv{ri) = / — - — v{dx). 
JrX-t] 

The Cauchy-Stieltjes transform characterizes the measure. For a probability measure on C, it is 
possible to define a Cauchy-Stieltjes-like transform on quaternions, by setting for U G , 

This transform characterizes the measure : in I?'(C), limt^o(9Af^([/(z, it))i2 = — 7r/i. If A is 
normal, i.e. if A* A = AA* , then it can be checked that R{U)kk G H_|_ and 



1 " 

n ^ — ^ 



n 
fc=i 

However, if A is not normal, the above formula fails to hold and the next proposition explains how 
to recover anyway ha from the resolvent. 

Theorem 2.1 (From resolvent to spectral measure). Let U — U{z,rf) E H+, and ak,bk,b'f., Ck be 
as in . Then (^3)) holds, 

1 " 



2n 

k=l 



and, in V (C), 



^ n ^ n 

fiA = dbk{-,0) = lim dbk{-,it). 

nn — ' 40 Tin — ' 

fc=i 



In particular, if A is a random matrix with exchangeable entries, then by linearity we get 

mKuA-Av) = IEai(z,?7), 

and, in V {C), 

E^lA --aE&i(-,0) = lim-i(9E6i(-,ii). 

Proof of theorem \2.1\ Through a permutation of the entries, the matrix B{z) is similar to 

{A-zf 



^{A-z)* 

whose eigenvalues are easily seen to be ±crfc(yl — z), 1 ^ fc ^ n. We get 

n n n 

tiR = (flfc + Ck) - Yj^c^f"^^ " ^) " ''^"^ + H - ^) - "^y^- 

k=l k=l k=l 

And the first statement and (12. 3p follow. Also, from (jA.Sp . in Appendix, for z ^ supp(/i^), 

U^Jz) = J \n\x\t,siz){dx) - ^ln|detB(z)|, (2.4) 

where is the logarithmic potential of a measure /i on C, see (|A.ip . Recall that the differential 
of X > det(Ar) at point X (invertible) in the direction Y is tT{X^^Y) det(Ar) (this is sometimes 
referred as the Jacobi formula). The sign of deti?(z) is (—1)". We deduce that in I?'(C), 
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With Rkk = R{U{z,{)))kk = (S(z)-i)fcfc, we get from dz = Q,dz = 1, 



a In I det B{z)\ = ^2 tr {nkk (^^ q) | = - E «)■ 

k=l ^ ^ ^ k=l 

Now from Equation (|X2|) . in ^'(C), using A = 499, 

1 2 " 

2^//^ = Ai7^^ = A— ln|dctS(z)| =--^96fc. 

fc=i 

To get the Umit as i J, 0, we note that for real t > 0, 

In |a; — it\iJ.B{z){dx) — — In | det(_B(z) — it)\. 

Note that det(-B(z) — it) is real and its sign is (—1)". As f 4- 0, the left hand side of the above 
identity converges in I?'(C), to J7^^. Taking the Laplacian, and arguing as above, we get 

f 2 " 
A / ln\x - it\fiB(z){dx) = ^dbk{z,it). (2.5) 

" k=i 

The conclusion follows. □ 

Note that even if — X^A; ^^fc ^ measure on C, for each 1 ^ k ^ n, —dbk is not in general a 
measure on C (default of positivity, this can be checked on 2 x 2 matrices). 

2.2. Bipartization of an operator. We shall generalize the above finite dimensional construc- 
tion. Let F be a countable set and let l'^{V) denote the Hilbert space defined by the scalar 
product 

V') := E ' 0u = ((^u^), 

where 5u is the unit vector supported on u e V. Let V{V) denote the dense subset of l'^{V) of 
vectors with finite support. Let {wuv)u,v£V be a collection of complex numbers such that for all 

E + \'Wvu? < OO 

We may then define a linear operator A on V^V), by the formula, 

{Su,ASy) = Wuv (2.6) 

Let be a set in bijection with V, the image of v & V being denoted hy v £ V. We set 
V'' — V UV and define the symmetric operator B on 'D{V''), by the formulas, 



B5.,) = B5.,) = 0. (2.7) 
In other words, if n„ : P{y^) — !• denotes the orthogonal projection on (u,?/). 



Wuv 



For z G C, we also define on 'D{V^), the symmetric operator B{z): for all w, w in F, 



((5„, i3(z)(5{)) = ((5{), i?(z)J„) = Wuv - zt{u = v) 

{6u,B{z)Su)^{Su,B{z)dv)^ 0. 

Hence, if we identify with {1,2} x V, we have 

B{z) ^ B -U{z,0)(g) Iv- (2.8) 

The operator B(z) is symmetric and it has a closure on a domain D{B) C ^^(y^). We also denote 
by B{z) the closure of -8(2:). If B is self-adjoint then B(z) is also self-adjoint (recall that the 
sum of a bounded self-adjoint operator and a self-adjoint operator is also a self-adjoint operator). 
Recall also that the spectrum of a self-adjoint operator is real. For all U — U{z^'ri) £ H^, 
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B{z) — rjlyb = B — U{z,ri) (g) ly is invertible with bounded inverse and the resolvent operator is 
then weU defined by 

R{U) = (Biz) ~ 7^Iy.)-\ 

We may then define 

In the sequel, we shall use some properties of resolvent operators. 



Lemma 2.2 (Properties of resolvent). Let B be the above bipartized operator. Assume that B is 
self-adjoint and let U = U{z,ri) Cz H-)-, v CzV. Then, ay,Cv G C-|-, for each z G C, the functions 
a„(z, •), fet,(z, •), 5^(z, •), Ci,(z, •) are analytic on C+, and 

|a,| ^ pm(77))-i, Id < (3m(77))-\ |5,| ^ (23m(7y))-i and \b'J ^ {23m{,j))-\ 

Moreover, if rj €z iM+j then and Cy are pure imaginary and b'^ — bv- 

Proof. For a proof of the first statements refer e.g. to Reed and Simon For the last statement 
concerning rj G we define the skeleton of B{z) as the graph on V'' obtained by putting an 

edge between two vertices u, v in V'^, if {6u, B{z)6y) 7^ 0. Then since there is no edge between two 
vertices of V or V, the skeleton of B{z) is a bipartite graph. 

Assume first that B{z) is bounded: for all u G V^, \\B{z)du\\ ^ C. Then for |?7| > C, the series 
expansion of the resolvent gives 

However since the skeleton is a bipartite graph, all cycles have an even length. It implies that for 
n odd, -B(z)"(5„) — 0. Applied first to w G F, we deduce that for \r]\ > C, a{z, —77) — ~a{z,r]) 
and then applied to v, we get c{z, —fj) — —c{z,rf). We may then extend to C+ this last identity 
by analyticity. For rj — it G iR+, we deduce that and c„ are pure imaginary. Similarly, since 
the skeleton is a bipartite graph, a path from a vertex v £V to & vertex u G 1^ must of be of odd 
length. We get for \ri\ > C 



71=0 ^ 

= {Sy,R{U)Sy) ^by{z,r]), 

where we have used the symmetry of B{z). It follows that b'^{z,—fj) — by{z,ri). If B{z) is not 
bounded, then B(z) is limit of a sequence of bounded operators and we conclude by invoking 
Theorem VIII.25(a) in [42]. □ 

2.3. Operator on a tree. We keep the setting of the above paragraph and consider a (non- 
oriented) tree T = {V, E) on the vertices V with edge set E (recall that a tree is a connected graph 
without cycles). For ease of notation, we note u ^ v ii {u,v} G E. We assume that if ^ E 

then Wuy = Wyu = 0. In particular Wyy — for all v gV . We continue to consider the operator A 
defined by (pj)) . 

In the special case when Wuy = Wyu for all w, v in V, the operator A is symmetric and we first 
look for sufficient conditions for A to be essentially self-adjoint. 

Lemma 2.3 (Criterion of self-adjointness). Let k > and T — (V, E) be a tree. Assume that for 
all u,v gV , Wuv = Wyu cind that if {u, v} ^ E then Wyy = Wyu — 0. Assume also that there exists 
a sequence of connected finite subsets {Sn)n^i in V, such that Sn C Sn+i, U„S'„ — V, and for 
every n and v G Sn, 

Then A is essentially self-adjoint. 

For a proof, see [TT| Lemma A. 3] . The above lemma has an interesting corollary for the bipartized 
operator B oi A defined by (E^-dl^. 



8 



CHARLES BORDENAVE, PIETRO CAPUTO, AND DJALIL CHAFAI 



Corollary 2.4 (Criterion of self-adjointness of bipartized operator). Let k > and T = (y, E) he 

a tree. Assume that if {u,v} ^ E then Wuv — Wvu — 0. Assume also that there exists a sequence 
of connected finite subsets {Sn)n^i in V, such that 5„ C Sn+i, UnSn = V, and for every n and 
V e Sn, 

{\Wuv\'^ + \Wvu\^) ^ K. 

Then for all z € C, B{z) is self-adjoint. 

Proof. From (|2.8I) . it is sufficient to check that B is self-adjoint. Let e V be a distinguished 
vertex, we define two disjoint trees = {V0,E0) and G0 — {y0,E0) on a partition (V0, V0) 
of V'^ as follows. The trees G0 and G0 are the unique trees such that S V0, G V0 and that 
satisfy the following properties 

(i) if {u,v} e E and u in V0 (or V0) then v G V0 (or V0) and {u,v} e E0 (or E0), 
(h) if {u, w} e i? and u in V0 (or V0) then u e V0 (or V0) and {u, w} e £'0 (or £^0). 
We note that by construction if u e V0 and v <E V0 then {Su,BSy) — 0. If follows that the 
operator B decomposes orthogonally into two operators B0 and 130 on domains in ^{¥0) and 
•^^(^0) respectively: S = ^0 ® i?0. We may then safely apply lemma 1^751 to B0 and B0. □ 

When the operator B is self-adjoint, the resolvent operator has a nice recursive expression due 
to the tree structure. Let € be a distinguished vertex of V (in graph language, we root the 
tree T at 0). For each v E V\{0}, we define Vy C V as the set of vertices whose unique path to 
the root contains v. We define Ty = {Vy,Ey) as the subtree of T spanned by Vv. We finally 
consider A^, the projection of AonVu, and By the bipartized operator of Ay. The skeleton of Ay 
is contained in Ty. Finally, we note that if B is self-adjoint then so is By{z) for every z G C. The 
next lemma can be interpreted as a Schur complement formula on trees. 

Lemma 2.5 (Resolvent on a tree). Assume that B is self-adjoint and let U — U{z,ri) G IHI+. Then 

where R{U)yy = n„i?B„(t^)n* and Rb^XU) — {By{z) — rj)^^ is the resolvent operator of By. 
Proof. Define the operator C on T>{V'') by its matrix elements 

C0 U0CU% = -U{z, 0) , Cy := H^Cn; = IlyCTl% = 

for all w G y such that v ^ 0, and n„Cn* otherwise. The operator C is symmetric and 
bounded. Its extension to £'^{V^) is thus self-adjoint (also denoted by C). In this way, we have 
fromy = {0}a^0K, 

B{z)^C + B with B = 0B„(z). 

D~0 

We shall write R{U) = {B — rjl)^^ for the associated resolvent of B. From the resolvent identity, 
these operators satisfy 

R{U)CR{U) = R{U) - R{U) . (2.9) 
Set Ryy = I{uR{U)Iiy and Ruy = n„i?([/)n*. Observe that R00 = — ?/^^/2- Also the direct sum 
decomposition V = {0}Ui>~0 implies Ryy = n„i?B^(C/)n* and Ruy = for every u ^ v with 
u ~ 0, w ~ 0. Similarly we have that R0y = = Ry0 for every v G V^\{0}. Using the identity 
E„6yn;n„=/, weget 

W0 R(U)C R{U)W*0 — R00C0 R00 + y ^ R00CyRy0 

D~0 

= r]-iC/(z,O)i?00-ry-i^ai?.0. 

t>~0 

We compose the identity (j2.9p on the left by n„ and on the right by II^, we obtain, for w ~ 0, 

RvvC*R00 — ~Ry0 . 



f W0y\ 
\Wy0 ) 
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We finally compose (I2.9P on the left by 110 and on the right by 11%, 

T]'^U{z, O)i?00 + T]^^ ^ CyRyyC*R00 = -rj^^h - R00, 

or equivalently {U{z, rf) + J2v'~.0 CyRyyC*)R!zsz = -h- □ 

2.4. Local operator convergence. In the next paragraphs, we are going to prove that the 
sequence of random matrices (A„) converges to a limit random operator on an infinite tree. Let 
us recall a notion of convergence that we have already used in jllj . 

Definition 2.6 (Local convergence). Suppose {An) is a sequence of hounded operators on £'^{V) 
and A is a linear operator on i'^(V) with domain D{A) D For any u,v G V we say that 

{An,u) converges locally to (A,v), and write 

(A„,m) {A,v) , 

if there exists a sequence of bijections an : V ^ V such that cr„(w) = u and, for all (p G 'D{V), 

(T~^A„cr„0 A(j), 

in P{V), as n 00. 

Assume in addition that A is closed and Viy) is a core for A (i.e. the closure of A restricted to 
'D{V) equals A). Then, the local convergence is the standard strong convergence of operators up 
to a re-indexing of V which preserves a distinguished element. With a slight abuse of notation we 
have used the same symbol (T„ for the linear isometry (t„ : £'^{V) — > £'^{V) induced in the obvious 
way. As pointed out in [llj , the point for using Definition 12.61 lies in the following theorem on 
strong resolvent convergence. 

Theorem 2.7 (From local convergence to resolvents). Assume that (An) and A satisfy the condi- 
tions of Definition \2. 6l and (j4„, u) {A, v) for some u,v G V . Let be the self-adjoint bipartized 
operator of An- If the bipartized operator B of A is self-adjoint and ^{V^) is a core for B, then, 
for all f/ e H+, 

Rb,^{U)uu^ Rb{UU,. (2.10) 
where Rb{U)vv — n.„i?B(C/)n* and Rb{U) = {B{z) — rj)^^ is the resolvent of B{z). 

Proof of theorem \2.7\ It is a special case of Reed and Simon [42l Theorem VIII. 25(a)]. Indeed, 
we first fix z e C and extend the bijection cr„ to by the formula, for all w (£ V, (t„(?I)) = 
Un(w). Then we define Bn{z) = Bn{z)an, so that Bn{z)(f) — > B{z)(j) for all (f> in a common 
core of the self-adjoint operators Bn{z), B{z). This implies the strong resolvent convergence, i.e. 
{Bn{z) — ril)~^\p — >• {B{z) — r]I)~^^p for any i] e C+, -0 G l'^{V). We conclude by using the identities 
: n„(i?„(z)-r//)-i5„ =n„(B„(z)-r//)-i5„ and n,(i?„(z) - 77/)-i(5« = n„(B„(z) - 77/)-^^ . □ 

We shall apply the above theorem in cases where the operators An and A are random operators 
on £'^{V), which satisfy with probability one the conditions of theorem 12. 71 In this case we say that 
{Am u) — ?> {A, v) in distribution if there exists a random bijection (T„ as in Definition 12.61 such that 
Gn^ AnCTnii couvergcs in distribution to Acj), for all G 'D{V) (where a random vector S (."^{V) 
converges in distribution to ■(/' if lim„_i.oo ^f{4'n) — ^f{i^) for all bounded continuous functions 
/ : P{V) — > M). Under these assumptions then (|2.10l) becomes convergence in distribution of 
(bounded) complex random variables. Note that in order to prove theorems 11.11 II. 2[ we will also 
need almost-sure convergence statements. 

2.5. Poisson Weighted Infinite Tree (PWIT). We now define an operator on an infinite rooted 
tree with random edge-weights, the Poisson weighted infinite tree (PWIT) introduced by Aldous 
[T], see also [3]- 

Let p be a positive Radon measure on M such that p{M.) — 00. PWIT(p) is the random weighted 
rooted tree defined as follows. The vertex set of the tree is identified with N-'' :— UfegNN'^ by 
indexing the root as N° — 0, the offsprings of the root as N and, more generally, the offsprings 
of some V G as {vl), (w2), • • • € N*''+^ (for short notation, we write {vl) in place of {v, 1)). In 
this way the set of w S N" identifies the n^^ generation. We then define T as the tree on N-'^ with 
(non-oriented) edges between the offsprings and their parents. 
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We denote by Be(l/2) the Bernoulli probability distribution ^Sq + ^^Si. Now assign marks to the 
edges of the tree T according to a collection {S^}j,gpj/ of independent realizations of the Poisson 
point process with intensity measure Be(l/2) on R x {0, 1}. Namely, starting from the root 0, 
let 'E.0 = El), (2/2, £2), • • ■ } be ordered in such a way that ^ I2/2I ^ • • • , and assign the mark 
{Ui^Ei) to the offspring of the root labeled i. Now, recursively, at each vertex v of generation fc, 
assign the mark (iivi,£vi) to the offspring labeled vi, where Sy — {(y^i, £t,i), (2/1,2, eu2), • • • } satisfy 
\yvi\ ^ 12/1,2 1 ^ ■ ■ • ■ The Bernoulli mark Syi should be understood as an orientation of the edge 
{v^vi} : if Syi — 1, the edge is oriented from vi to v and from v to vi otherwise. 

For a probability measure 9 on S^, we introduce the measure on C, for all Borel D: 

W) = f f t{^-.reD}0{duj)dr (2.11) 
Jo 

Consider a realization of PWIT(2£e). We now define a random operator A on VCN^) by the 
formula, for all v and € N, 

{Sy,ASyk) ^ SvkVyk^"' s-n*^ {5yk,ASy) = {l-eyk)yyk^" (2-12) 

and {Sy, ASu) = otherwise. It is an operator as in j)2.3l Indeed, ii u = vk is an offspring of v, we 
set 

Wyu = EvkUyk^"' and Wuy = (1 - Syk)yyk^°', (2-13) 
otherwise, we set Wuy = 0. We may thus consider the bipartized operator B of A. 

Proposition 2.8 (Self-adjointness of bipartized operator on PWIT). Let A be the random operator 
defined by (|2.12p . With probability one, for all z €z C, B{z) is self-adjoint. 

We shall use Corollary 12.41 We start with a technical lemma proved in [11] Lemma A. 4]. 

Lemma 2.9. Let k>0, 0<a<2 and let < xi < X2 < ■ ■ ■ be a Poisson process of intensity 1 
on M-|_. Define t — inf{/; G N : ^k ^ '*}• Then Er is finite and goes to as k goes to 

infinity. 

Proof of proposition \2.8[ For k > and w G N-^ , we define 

00 

= inf{t>0: ^ |2/.fer'/" «}• 
fc=t+i 

The variables (t„) are i.i.d. and by lemma [^751 there exists k > such that Er„ < 1. We fix such 
K. Now, we put a green color to all vertices v such that Ty ^ 1 and a red color otherwise. We 
consider an exploration procedure starting from the root which stops at red vertices and goes on 
at green vertices. More formally, define the sub-forest of T where we put an edge between v 
and vk if w is a green vertex and 1 ^ fc ^ r^. Then, if the root is red, we set Si = C^{T) = {0}. 
Otherwise, the root is green, and we consider = (V|, E%) the subtree of that contains the 
root. It is a Galton- Watson tree with offspring distribution t^. Thanks to our choice of k, Tj| is 
almost surely finite. Consider Ll, the leaves of this tree (i.e. the set of vertices v in V0 such that 
for all 1 ^ fc ^ T„, vk is red). We set 5*1 = Ut,eLS {1 ^ fc ^ r^, : vk}. Clearly, the set 5*1 satisfies 
the condition of Lemma [^751 

Now, we define the outer boundary of {0} as 9t{0} = {1, • • • ,T0} and for v — (zi, ■ ■ ■ ik) G 
N-''\{0} we set dr{v} = {{ii, ■ ■ ■ ,ik^i,ik + 1)} U {{ii, ■■■ ,ifc, 1), • • • , {ii, ■ ■ ■ ,ik,Tv)}. For a con- 
nected set S, its outer boundary is 




Now, for each vertex mi, • • • ,Uk G drSi, we repeat the above procedure to the rooted subtrees 
Tui, ■ ■ ■ ,Tuf.. We set S2 — Si IJ Ui^i^feC^(T„.). Iteratively, we may thus almost surely define an 
increasing connected sequence (S'„) of vertices with the properties required for Corollary 12.41 □ 
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2.6. Local convergence to PWIT. We may now come back to the random matrix An defined 
by (|1.4p . Wc extend it as an operator on T>{N^) by setting for 1 ^ i, j ^ n, {Si,ASj) — Aij and 
otherwise, if either i or j is in N-^\{1, • • • n}, {6i, A6j) ~ 0. 
The aim of this paragraph is to prove the following theorem. 

Theorem 2.10 (Local convergence to PWIT). Assume (HI). Let An be as above and A be the 
operator associated to PWTT{2£g) defined by (j2.12p . Then in distribution (A„,l) — > (A,0). 

Up to small differences, this theorem has already been proved in [TTJ Section 2]. We review here 
the method of proof and stress the differences. The method relies on the local weak convergence, 
a notion introduced by Benjamini and Schramm [5], Aldous and Steele [HI, see also Aldous and 
Lyons |2]. 

We define a network as a graph with weights on its edges taking values in some metric space. 
Let Gn be the complete network on {1, . . . ,n} whose weight on edge equals {^"j), for some 

collection {ii'j)i^i^j^n of i.i.d. complex random variables. We set = ^"j-. We consider the 
rooted network (G„, 1) obtained by distinguishing the vertex labeled 1. 

We follow Aldous [U Section 3]. For every fixed realization of the marks (^^■), and for any 
B,H en, such that {B"+^ - 1)/{B - 1) ^ n, we define a finite rooted subnetwork (G„, 1)^'" of 
[Gn, 1), whose vertex set coincides with a B-aiy tree of depth H with root at 1. To this end we 
partially index the vertices of {Gn, 1) as elements in 

JB^H = uta{lr-- ,BYcNf, 

the indexing being given by an injective map ct„ from Jb.h to Vn {1, . . . , n}. We set I0 = {1} 
and the index of the root 1 is cr~^(l) — 0. The vertex v S Vn\l0 is given the index (fc) — ct,^^(w), 
1 ^ k ^ B, if has the fc*^ smallest absolute value among j 1}, the marks of edges 

emanating from the root 1. We break ties by using the lexicographic order. This defines the first 
generation. Now let Ii be the union of I0 and the B vertices that have been selected. If ^ 2, we 
repeat the indexing procedure for the vertex indexed by (1) (the first child) on the set Vn\Ii- We 
obtain a new set {11, • • • , IB} of vertices sorted by their weights as before (for short notation, we 
concatenate the vector (1,1) into 11). Then we define I2 as the union of Ii and this new collection. 
We repeat the procedure for (2) on Vn\l2 and obtain a new set {21, • • • , 2B}, and so on. When 
we have constructed {Bl, ■ ■ ■ , BB}, we have finished the second generation (depth 2) and we have 
indexed {B^ — 1)/{B—1) vertices. The indexing procedure is then repeated until depth H so that 
(5^+^ — 1)/(_B — 1) vertices are sorted. Call this set of vertices Vn'^ — crnJB,H- The subnetwork 
of Gn generated by V^'^ is denoted (G„, 1)^'^ (it can be identified with the original network G„ 
where any edge e touching the complement of V^'^ is given a mark Xe = 00). In (G„, 1)^'''^, the 
set {ul, ■ ■ ■ , uB} is called the set of offsprings of the vertex u. Note that while the vertex set has 
been given a tree structure, (G„, 1)^^^ is still a complete network on Vn'^- The next proposition 
shows that it nevertheless converges to a tree (i.e. extra marks diverge to 00) if the ^"^ satisfy a 
suitable scaling assumption. 

Let p be a Radon measure on C and let T be a realization of PWIT(p) defined in ^12. 51 For 
the moment, we remove the Bernoulli marks {£v)v<^'hs and, for w G N-' and fc G N, we define the 
weight on edge {v, vk} to simply be y^k- Then (T, 0) is a rooted network. We call (T, 0)^'^ the 
finite random network obtained by the same sorting procedure. Namely, (T, 0)^'^ consists of the 
subtree with vertices in Jb,h, with the marks inherited from the infinite tree. If an edge is not 
present in (T, 0)^'^, we assign to it the mark +00. 

We say that the sequence of random finite networks (G„, 1)^^^ converges in distribution (as 
n — ^ 00) to the random finite network (T, 0)^'^ if the joint distributions of the marks converge 
weakly. To make this precise we have to add the points {±00} as possible values for each mark, 
and continuous functions on the space of marks have to be understood as functions such that the 
limit as any one of the marks diverges to +00 exists and coincides with the limit as the same mark 
diverges to —00. We may define C = C U {±00}. The next proposition generalizes [1, Section 3], 
for a proof see [TT] Proposition 2.6] (the proof there is stated for a measure p on R, the complex 
case extends verbatim). 

Proposition 2.11 (Local weak convergence to a tree). Let {£,i^j)i^i^j^n be a collection of i.i.d. 
random variables in C and set f " ^ = ^"^ . Let p be a Radon measure on C with no mass at and 
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assume that 

nP{^^,e-) - p. (2.14) 



n— f oo 



Let Gn be the complete network on {l,...,n} whose mark on edge equals ^j", and T 

realization ofPWlT(p). Then, for all integers B,H, 



Now, we shall extend the above statement to directed networks. More precisely, let {£4 j)i^i,j^n 
be i.i.d. real random variables. We consider the complete graph G„ on Vn whose weight on edge 
equals, if i < j, ('^"j,'^"^) G K^. As above, we partially index the vertices of (Gn, 1) as 
elements in 

the indexing being given by an injective map (T„ from Jb,h to Vn such that ct^^(1) = 0. The 
difference with the above construction, is that the vertex v G is given the index (fc) = 

cr-^(w), 1 ^ fc B, if min(|f5^„|, j^^ J) has the k^^ smallest value among {min(|Cl',,- 1, j ^ 1}. 

Similarly, let (T, 0) be the infinite random rooted network with distribution PWIT(p). This 
time we do not remove the Bernoulli marks {Sv)veW ^^id define the weight on edge {v,vk} as 
{yvk,oo) if Evk — 1 and {00, y^k) if £vk — 0. Again, we call (T, 0)^'^ the finite random network 
obtained by the sorting procedure : (T, 0)^'^ consists of the subtree with vertices in Jb,h, with 
the marks inherited from the infinite tree. 

We apply proposition 12 . 1 1] to the complete network G„ with mark on edge {i,j} equals, if i ^ j, 
to min(|^"^ I, This network satisfies (|2.14l) with 2p. We remark that if u,v G Jb,h then 

from (|2.14L maxd^^^^^^^^^^^^l, IC„(^;),a-„(«) I) diverges weakly to infinity. We also notice that, given 
{Gn, l)^'", with equal probability IC„(«),o-„(^) I is larger or less than IC„(«),o-„(«) I- We deduce the 
following. 

Corollary 2.12 (Local weak convergence to a tree). Let p be a Radon measure on C with no mass 
at 0. Let ('?."j)i^i.j^n be a collection of i.i.d. random variables in C such that i2.14^ holds. Let Gn 
be the complete network on {l,...,n} whose mark on edge {i,j} equals, if i ^j, {£4 j , £Jj i) , and T 
a realization o/PWIT(2p). Then, for all integers B,H, 



(G„,l)^'« - (T,0)^'^ 

n— f 00 

We may now prove theorem 12. 101 

Proof of theorem \2.10\ We argue as in the proof of theorem 2.3(i) in [11^ Section 2]. We first define 
the weights (Ci"j)ijeN^ as follows. For integers 1 < z,j ^ n, we set 

with the convention that = 00 if Xij — 0. For this choice, by assumption (HI), (|2.14p holds 
with p = Ig and ig in (j2.11l) . If i or j is in N-^\{1, • • • , n}, we set ^"^ — 00. 

Let Gn denote the complete network on {I,-- - ,n] with marks (C"j,C7i) edge if 
i ^ From Corollary HHl for all B, i7, (G„, 1)^^-^ converges weakly to {T,0)^-", where T has 
distribution PWlT{2£g). Let A be the random operator associated to T. 

Let an'^ be the map (T„ associated to the network (G„,l)-^'^. The maps cr„ are arbitrarily 
extended to a bijection N-'^ N-^ . From the Skorokhod Representation Theorem we may assume 
that (G„, 1)^'^ converges a.s. to (T, 0)^'^ for all B,H. Thus we may find sequences Bn,Hn 
tending to infinity and a sequence of bijections a„ := an"'^"" such that (i3^"+^ ~ l)/(i?„ ~1) ^ n 
and such that for any pair -u, w e N-^ we have C5„(u) ^„{v) which converge a.s. to 

if for some integer fc, v = uk and Suk — 1 
if for some integer k, u — vk and Syk = 
otherwise 




It follows that a.s. 
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For any v, set ip'^ := a^^^ A„an6v. To prove theorem 12.101 it is sufficient to show that for any 
V , V'n ~^ ^"^{^0 almost surely as n goes to infinity, i.e., 

u 

From what precedes, we know that ip'^) — >■ Ad^) for every u. The claim follows if we have 
(almost surely) uniform (in n) square-integrability of {{Su, 4'n))u- This in turn follows from Lemma 
2.4(i) and Lemma 2.7 in [JT]- □ 

2.7. Convergence of the resolvent matrix. Let An and A be as in theorem 12.101 From 
proposition [521 we may almost surely define the resolvent R of the bipartized random operator of 
A. For U = U{z,ri) e IHI+, we set 



(2.15) 



We define similarly, Rn[U) = {Bn{z) — i]) ^, the resolvent of i3„, the bipartized operator of An- 

Weseti?„(c/)ii = nii?„(c/)nt. 

Theorem 2.13 (Convergence of the Resolvent matrix). Let An and A be as in theorem \2.1(A For 

all U = U{z,7^) e H+, 

Rn{U)n w R{U)00. 

n— f oo 

Proof of theorem \2.13[ We apply proposition [23 theorem 12.101 and the "in distribution" version of 
theorem □ 

2.8. Proof of theorem 11.11 Again, we consider the sequence of random n x n matrices {An) 
defined in introduction by (|1.4[) . 

Theorem 2.14. For all z G C+, almost surely the measure VAn-z{dx) converges weakly to a 
measure h'a,z{dx) whose Cauchy-Stieltjes transform is given, for rj G C+, by 

TT^VcAv) = Ea(z,77), 

where a{z,r]) was defined in (|2.15p . 

Proof. For every z g C, by proposition 12.81 the operator B{z) is a.s. self-adjoint. It implies that 
there exists a.s. a measure on R, 1^0, z, called the spectral measure with vector 80, such that for 
all 1] e C+, 

a{z,ri) = {50,R[U)50) = [ ^^^^^^—^ = m^^ ,{r]). 

J x-ri 

We define i?„ as the resolvent matrix of i?„, the bipartized matrix of yl„. For U = U{z,rf) E H+, 
we write Rn{U)kk — ( ?f ^] ■ By theorem |2. 11 

By lemma [5?^ for U E H_|_, the entries of the matrix i?„(C/)ii are bounded. It follows from theorem 
(21^ that for aU U eM+, 

a & 1 



lim ERn{U)ii E I ^, ^ 

where the limit matrix was defined in (j2.15p . Hence, for all z E C+, 

lim niEc^ _,{ri) =Ea(z,?7). 

n— f 00 

We deduce that ¥,i>A„-z converges to the measure iya,z = ^1^0, z- This convergence can be improved 
to almost sure by showing that the random measure i^yi„-z concentrates around its mean. This 
is done by applying Borel-Cantelli Lemma and lemma [0.21 to the matrix Bn{z) whose spectral 
measure equals i^An-z, see (|2.3p . □ 

Theorem 1 1.1 1 is a corollary of the above theorem up to the fact that Ea(z, rf) does not depend on 
the measure 9 which appears in (HI). The latter will be a consequence of the forthcoming theorem 
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3. Convergence of the spectral measure 

3.1. Tightness. In this paragraph, we prove that the counting probabihty measures of the eigen- 
values and singular values of the random matrices (An) defined by (|1.4p are a.s. tight. For ease of 
notation, we will often write A in place of A„. 

Lemma 3.1 (Tightness). // (HI) holds, there exists r > such that for all z € C, a.s. 

/■oo 

lim / f UA-zi{dt) < oo, and thus {vA-zi)n'^i is tight. 



Moreover, a.s. 

lim / \zY HAidz) < oo, and thus {nA)n>i is tight. 

n->oo 

Proof. In both cases, the a.s. tightness follows from the moment bound and the Markov inequality. 
The moment bound on fiA follows from the statement on i>a (take z = 0) by using the Weyl 
inequality (IB.6|) . It is therefore enough to establish the moment bound on va-zI for every C. Let 
us fix z e C and r > 0. By definition of va-zI we have 

/>00 -f ^ 

/ t'',,A-zi{dt)^-y2s„{A-zlY. 

From (jB.2l) we have Sk{A — zl) ^ Sk{A) + \z\ for every 1 ^ fc ^ n, and one can then safely assume 
that z = for the proof. By using (|B.7p we get for any ^ r < 2, 

/ f UA[dt) ^ Z„ := - ^ where K,, := ^ <'|X»jf 

"'o 1=1 \j=i ) 

We need to show that (Z„)„^i is a.s. bounded. Assume for the moment that 

supE(y„^) < oo (3.1) 

for some choice of r. Since Yn.i, ■ ■ ■ , Yn.n are i.i.d. for every n ^ 1, we get from p.ip that 

E{{Z„. - EZ„)4) = n-^E (^«- - Er„,.)'(r„j - EY„^,f = 0{n-^). 

Therefore, by the monotone convergence theorem, we get E(^^>j^(Z„ — EZn^) < oo, which gives 
J2n^ii^n — EZny < OO a.s. and thus Z„ — EZ„ — > a.s. Now the sequence (EZ„)„^i = (Ey,j_i)„^i 
is bounded by p.l[) and it follows that {Zn)n^i is a.s. bounded. 

It remains to show that (|3.1I) holds, say if < 4r < a. To this end, let us define 

n 

Sn,a,b — Xl<^l^yl^^{a;r'|Xi,pe[a,6)} ^vcry a < b. 
i=i 

Now y„4i ^ (5„,o,oo)''' = (^„,o,i + SnA,oo?'' and thus, 

E{Yl,) < 22'--i{E(55^i) +E(52.^^^)}. (3.2) 
We have sup„>]^ E{S^^^q j^) < oo. Indeed, since 2r < 1, by the Jensen inequality, 

E(Sf- .1) (E5„,o,i)"- 



and by lemma [C.li 

ES'„,o,i ~n a/{2 - a). 
For the second term of the right hand side of p. 21) , we set 

M„ := max a~^\Xij\t. i,^ and iV„ := #{1 ^ j n s.t. a-^\Xij\ > 1}. 
From Holder inequality, ifl/j>+l/g = l, we have 

IE(^,M,oo) < E (iVf AC) < (EVYf^) (EAC) • (3.3) 
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Recall that P(|Xi2| > a„) (1 + o(l))/n < 2/n for n > 1. By the union bound, for n > 1, 

In particular, we have swpj^^iKN^^ < oo for any rj > 0. Similarly, since the function L is slowly 
varying, for 1 and all i 1, we have 

P(M„ ^ i) nF{\Xi2\ > tan) = na-'^r"' L{ant) 2t"". 
It follows that if 7 < a, sup„>]^ Eil/^ < oo. Taking p and q so that Arq < a, we thus conclude 
from ([331) that sup„^i E{Sl\^) < oo. □ 

3.2. Invertibility. In this paragraph, we find a lower bound for the smallest singular value of the 
random matrix A — zl where A is defined by (I1.4L in other words an upper bound on the operator 
norm of the resolvent of A. Such lower bounds on the smallest singular value of random matrices 
were developed in the recent years by using Littlewood-Offord type problems, as in |48il49j and [45]. 
The available results require moments assumptions which are not satisfied when the entries have 
heavy tails. Here we circumvent the problem by requiring the bounded density hypothesis (H3). 
The removal of this hypothesis can be done by adapting the Rudelson and Vershynin approach 
already used by Gotze and Tikhomirov [28] . 

Lemma 3.2 (Invertibility). // (H3) holds then for some r > 0, every z G C, a.s. 

lim n^Sn{A — zl) = +oo. 

n— ^cxD 

Proof. For every x,y £ C" and S* C C", we set x ■ y := xiyi + ■ ■ • + x„y^ and ||x||2 ■— \Jx ■ x and 
dist(a;, S) := minj,g5 ||x — y\\2- Let _Ri, . . . , i?„ be the rows of A — zl and set 

R-i :— span{ _Rj;j 7^ i} 

for every 1 ^ i ^ n. From lemma IB. 2 1 we have 

min dist(i?i, ^ ^/nSn{A— zl) 

l^^i^Cn 

and consequently, by the union bound, for any u ^ 0, 

PiVnSn{A- zl) ^u) max P(dist(i?j, ^ u). 

Let us fix 1 ^ i ^ n. Let Yi be a unit vector orthogonal to R-i. Such a vector is not unique. We 
just pick one. This defines a random variable on the unit sphere §"~^ = {x G C" : ||a;||2 !}• By 
the Cauchy-Schwarz inequality, 

\R^■Y,\ ^ \\tt,{R,)\\^\\Y,\\., ^ dist{R,,R^i) 

where TTi{-) is the orthogonal projection on the orthogonal complement of R-i. Let Ui be the 
distribution of Yi on §"^^. Since Y^ and Ri are independent, for any u ^ 0, 

P(dist(i?„i?_0 <m) s^P(|i?, -K,! ^u) = / ¥{\R, ■ y\ u) di^,{y) 

Let us first consider the case where Xn has a bounded density ip on C. Since ||y||2 = 1 there 
exists an index jo G {!,..., n} such that yj^ 7^ with \yjo\ ^ ^ \/n. The complex random 
variable i?i • y is a sum of independent complex random variables and one of them is a^^Xij^ yj^, 
which is absolutely continuous with a density bounded above by a„Y^ Consequently, by a 

basic property of convolutions of probability measures, the complex random variable Ri ■ y is also 
absolutely continuous with a density (pi bounded above by a„-yn and thus 

P{\Ri ■ y\ u) = / l|^|^„(^j(s)ds ^ 7^M^a„^/n||^||^. 
Jc 

Therefore, for every & > 0, 

¥{sniA - zl) ^ = 0{n^/^-^'an) 

where the O does not depend on z. By taking b large enough, the first Borel-Cantelli lemma implies 
that there exists r > such that a.s. for every z S C and n ^ 1, 

SniA~zI) ^ tT''. 
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It remains to consider the case where Xn has a bounded density (p on M. As for the complex case, let 
us fix ?/ S §"~^. Since ||?;||2 — 1 there exists an index Jq e {!,...,«} such that \yjg ^ ^/n. Also, 
either \D\e{yjg)\^^ ^ V2n or |Um(yjo)|^^ ^ ^/2n. Assume for instance that |5He(j/jo)|^^ ^ V2n. 
We observe that for every u ^ 0, 

n\R^■y\ ^u) ^F{\d\t{R^-y)\ 

The real random variable *He(i?i • y) is a sum of independent real random variables and one of them 
is a~^Xijg9ic{yjg), which is absolutely continuous with a density bounded above by a„-\/2n ||<y9|jj^. 
Consequently, by a basic property of convolutions of probability measures, the real random vari- 
able D\e{Ri ■ y) is also absolutely continuous with a density ipi bounded above by a„-\/2ri 
Therefore, we have for every u ^ 0, 

We skip the rest of the proof, which is identical to the complex case. □ 

3.3. Distance from a row to a vector space. In this paragraph, we give two lower bounds on 
the distance of a row of the random matrix A — z defined by (|1.4p to a vector space of not too 
large dimension. The first ingredient is an adaptation of Proposition 5.1 in Tao and Vu [50) . 

Proposition 3.3 (Distance of a row to a subspace). Assume that (HI) holds. Let < 7 < 1/2, and 
let R be a row of an{A — z). There exists S > depending on a, 7 such that for all d-dimensional 
subspaces W o/C" with n — d ^ n^~"' , one has 



(dist(i?, W) < n(i-27)/a^ ^ 



The proof of proposition 13.31 is based on a concentration estimate for the truncated variables 
-^iil{|Xii|^6„} for suitable sequences 6„. We first recall a concentration inequality of Talagrand. 

Theorem 3.4 (Talagrand concentration inequality [46] and [34] Corollary 4.10]). Let us denote 
by 1} :— {z (z C; \z\ ^ 1} the complex unit disc and let P be a product probability measure on the 
product space D". Let F : D" — R &e a Lipschitz convex function on D" with ||-F||Lip ^ 1- If 
M{F) is a median of F under P then for every r ^ 0, 

P{\F-M{F)\ > r) 4e^'''/^ 

Proof of proposition \3.kA We first perform some pre-processing of the vector R as in Tao-Vu [50j . 
To fix ideas, we may assume that R is the first row of a„(yl — z). Then R — Xi — za„ei where Xi 
is the first row of AT = a„y4. We then have 

dist(i?, ly) ^ dist(A:i - za„ei, span(T^, ei)) = dist(Xi, Wi). 

where we have set Wi = span(VF, ei). Note that d < dim VFi ^ d + 1. 
For any sequence 6„, from the Markov inequality, 

/ „ \ 



^l + eL(6„)6-")' 



Choose bn = a„n-2T/". Clearly, 6„/n(i~27)/a g [n^^n^] eventually for all e > 0. 

Let J denote the set of indexes i such that \Xxi\ ^ 6„. From (j3.4l) we see that, for some 5 > Q: 



It follows that it is sufficient to prove the statement conditioned on the event {\J\ ^ n— -/n}- In 
particular, we shall prove that for any fixed / C {1, . . . , n}, such that |/| ^ ri — ^/n, 

P (dist(A:i, Wi) < n(i-2'')/" \J = l) ^ e""'. (3.5) 
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Without loss of generality, we assume that / = {1, • • • , n'} with n' ^ n — y/n. Let irj be the 
orthogonal projection on span(ei : i G I). If W2 — tti{Wi), we find d — y/n ^ dim(W2) ^ 
dini(W^i) < d + 1 and 

dist(Xi,W^i) ^ dist(7r/(Xi),W2). 
Note that ttj{Xi) is simply the vector Xu, i = 1, . . . ,n' . We set 

W = span(T4^2, IE[7r/(Xi) \J^I]), Tri{Xi) - E[7r,(Xi) \J^I], 

so that d - i/n dim(VF' ) + 2 and 

dist(7r/(Xi),VF2) ^ dist(y,W^')- 

Let P denote the orthogonal projection matrix to the orthogonal complement of W in C" . We have 
dist^(y, W) = J YiPijYj, and, since Y — (Ki)i^i^„' is a mean zero vector under P(- |Z = /), 

E[dist2(y, W')\J = I]= E[Y,Y^P^JY, \J = I 



^ PuE[\Y,\^ \J = I]= E[|yi|2 \J = I] tiP. 



i=l 



We have for any e > and for n 1: 

n\yi\^ \J = I]= E[|Xn|2 \J = I]- (E[|Xn| | J - /])' ^ , 
where the last bound follows from lemma [C.l[ since by independence one has 

\j = i] = n\xii\^ I i^iii ^ b„] , 

and \E[Xii\J ^ ^ |E[Xii||Xii| < b„]\^ is 0(1) if a > 1, while (by lemma [ClT]) it is 

0(^2-2a+e) fo^ £ > 0, if « S (0, 1]. 

Using trP = n' — dim(W^') ^ ^ (n — d), it follows that, for any e > 0, for n ^ 1: 

E[dist2(y, W')\J^I]^ cL{bn)bl-°'{n ~ d) ^ n«(^) , (3.6) 

where q := {1 — 27)^ + 7 — e. 

Under P(- 1 ^7 = /), the vector (Yi/6„, • • • , y„' /&„) is a vector of independent variables on D" , 
where B be the unit complex ball. We consider the function F : x 1-^ dist(x, W). The mapping F 
is 1-Lipschitz and convex. From theorem 13.41 we deduce that 

2 

P(|dist(y, W') - M(dist(r, W'))\ '^r\J = 1)^ 4e"^ (3.7) 

where M(dist(r, W')) is a median of dist(y, W) under P(- [J /). 

It follows that, for e.g. 5 — 7/2, taking e — 7/4 in (I3.6p . we obtain g(e) = (1 — 27)|- + 5 + e, 
and therefore there exists c > such that n ^ 1, 

6-2E[dist2(y,W^')l^ = ^] >cn^ (3.8) 





From (1221) it follows that 



E 



|M(dist(y, w')) - dist(y, M^')r \ J 



= 0(bi 



From the Cauchy-Schwarz inequality we then have 



M(dist(y, w')) - y^E[dist2(y,VF') 1^ = -^1 

E [|Af(dist(y,VF')) -dist(y,M^')l^ \J = I = O {bl) . 

The above estimates, with and (13^, imply that M(dist(y, 11^')) ^ 5 f^j. „ > 1. 

Therefore, for n ^ 1, 

P (dist(y, W) ^ I = 

^ p (^|M(dist(y, VK')) - dist(y, w')\ ^ i 7i''('^)/2 . 

The desired conclusion p.Sp now follows from p.7p and p.Sp . □ 
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So far we have shown that under assumption (HI), the distance of a row to a space with 
codimension n — d^ 'n}~^ is at least n'^^"^''')/" with large probability. We want a sharper estimate, 
namely at the order 71^/". We will obtain such a bound in a weak sense in the forthcoming 
proposition 13.71 Furthermore, we shall require assumption (H2) to do so. We start with some 
preliminary facts. 

Below we write Z = Z'^'^\ (3 e (0,1), for the one-sided /3-stable distribution such that for all 

Eexp(-sZ,) exp(-/) . 
From the standard inversion formula, for m > 

poo 

we see that all moments 

/>C30 

E[Z-"] = r(m)-i / e"^" dx (3.9) 



are finite for to > 0. Also, recall that if {Zi)i<^i<^„ is an i.i.d. vector with distribution Z then, for 
every {wi)i^i!^„ G M" , in distribution 

Y.w,Z,^ij2w^\ Z,. (3.10) 

1=1 \i=l / 

Indeed, p.lOp follows from Eexp(— s ^ w^Zi) — exp(— s*^ ^ wf ) and a change of variables. 

Lemma 3.5. Assume (H2). There exists e > and p G (0, 1) such that the random variable 
dominates stochastically the random variable eD Z, where ¥{D = 1) = 1 — P(Z? — 0) — p is a 
random variable with law Be(p) , Z = Z^^"^ with (3 = ^, and D and Z are independent. 

Proof. From our assumptions, there exist S > and xq > such that 

VQXiil'^ > x) ^ Sx-^ ;?V{S^ Z > x) , x>xo. 

Let p be the probability that [Xnp > xq- li x > xq then Pd^np > x) ^ p¥{5^ Z > x) = 
¥{5'^DZ > x). On the other hand, if a; xo then P(|Xii|2 > x) ^ p > V{6^ D Z > x). In any 
case, setting e — 5"^ we have 

V{\Xii\^ > x) ^ F{e D Z > x) , x > 0. 
This implies the lemma. □ 

Lemma 3.6. Assume (H2). Let uji G [0,1] be numbers such that uj{n) :— "^i ^ 712+^ for 

some e > 0. Let Xi = (^ii)i^i^n be i.i.d. random variables distributed as Xn, and let Z = Z'--^'^ 
with /3 ^ J. There exist 6 > and a coupling of Xi and Z such that 

P ^^w.IXhP Suj{n)^Z^ ^ e""' . (3.11) 

Proof. Let D — (-Di)i^j^„ denote an i.i.d. vector of Bernoulli variables with parameter p given by 
lemma [331 From this latter lemma and (j3.10p we know that there exist e > and a coupling of 
Xi, D and Z such that 



' n n \ 

Y,u,\Xuf^e[Y.u;^D,yz\=l 



^i=l i=l 

It remains to show that for some e' > 0: 



^wf A =^ e'uj{n) ) < e- 
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Observe that wf ^ uji, so that E^"^j^ wf Di ^ pu}{n). Therefore, for < e' < p, 

where we have used the HoefFding inequahty in the last bound. Since w(n) ^ n^~^^ , this imphes 
the lemma. □ 

Proposition 3.7. Assume (H2) and take < 7 ^ a/A. Let R he the first row of the matrix 
an{A — z). There exists a constant c > and an event E such that for any d-dimensional subspace 
W of C"' with codimension n — d ^ n^^^ , "we have 

E[dist'^{R,W); E]s^c{n-d)-i and cn-^^-^'^'>/" . 

Proof. As in the proof of proposition 13. 31 we have 

dist(i?,VF) ^ dist(Xi,H/i), 

where Wi — span(Ty, ei), d ^ dim VFi < d + 1, and Xi = {Xii)i^i^n is the first row of X = a„A. 
Let I denote the set of indexes i such that \Xii\ ^ a„. From p.4p we know that 

P(|Z| <n-V^) <e""\ 

for some S > 0. It is thus sufficient to prove that for any set / C {1, . . . ,n} such that |/| ^ n— y/n, 

E[dist"2(^^ W); Ei\I = I]i^c{n- d^i , 

for some event Ej satisfying P((£^/)^ \ I = I) < n^^^^^''')/". We will then simply set 

E = Exn{\I\ ^ n- V^}. 

Without loss of generality, we assume that / = {1, ■ • • , n'} with n' ^ n — v}!'^ . Let tt/ be the 
orthogonal projection on span(ei : i G I). If W2 — 7r/(Wi), set 

W' =span(VK2,E(7r7(Xi)|I = /)) . 

Note that d-^/n^^ dim(T4^') dim(VKi) + ls^d+2. Defining 

we have 

dist(i?, ^ dist(Xi,W^i) ^ dist(y, W). 
Thus, Y = {Yi)i^i^n' is an i.i.d. mean zero vector under P(- \ I = I). Let P denote the orthogonal 
projection matrix to the orthogonal of W in C" . By construction, we have 

n 

E (dist2(r, W')\I = I) = e(^Y^ Y,P,jYj 1 1 = = E [lYipl I = /] trP . 

Here trP = where Pa — {ei,Pei) G [0, 1] and trP — n' — dim(VF') satisfies 

2(n- d) > trP ^ i(n- d) . (3.12) 

Let S = -£^1 P^^\Y^?■ We have 

E((dist2(y,i^')-^)'|2: = /) = E I (^y,p,,F,)V = / 

E PnnP^.n^ {Y^,Y,,Y,,Y,, \I = l) 
^ 2E[|yip |I = /]trp2. 
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Note that, 

E[|yip|Z = /] <E[|Xn|'|I = /] 

= E[\Xn\^\\Xn\^an] 



P(|Aii| < a„) 



where the last bound follows from lemma \C~T\ Since P = P, we deduce that 

E[(dist2(y,w^')-^)'|2: = /] =o(^a' . (3.13) 

Next, let Z = Z^^^^ with /3 = ^, as in lemma [3761 Set uji = Pa, i = 1, . . . n', and for e > 0, consider 
the event 

From lemma [22] (with n replaced by n' ^ n — n^/^) and using p. 121) there exists a coupling of the 
vector Xii, i — 1, . . . ,n' and Z such that 

P(rj) s; e""' , (3.14) 

for some S > and some choice of e > 0. Also, since (a — b)'^ ^ — for all a, 6 e R, we have 
S ^ ^ Sa — Sb, where 

n' n' 
1=1 i=l 

From Lemma IC.ll and p.l2p we have 

5b=E[|Xn|||XnKa„]'trP = /i(")(n,d) (3.15) 

where h^°'\n,d) - {n-d)al/n^ if a G (0,1] and h'-°'\n,d) - (n - d) if a e (1,2). Let G) be the 
event that Sa ^ i Sb- From p.lSp and the definition of F/ we have, for some cq > 

¥{{G}y n F/ |Z = /)==: P(Z ==: co(n - d)-i/^/i(")(n,d) |I = /) . 

Note that, thanks to the assumptions n~d ^ n^^'^ , 7 ^ a/4, we have {n~d)^^/^h'^°'\n,d) ^ n"^'' 
for some Eq = £o(ck) > for all a £ (0, 2), for n ^ 1. Therefore, for n ^ 1, 

P((G})= n F/ 1 1 = /) P(Z «C Co 11 = /) 

^ P(Zg:con-^°; j:a„, V^ = l,...,nO 

P(|XHKa„, V^ = l,...,r^') 

where the last identity follows from the independence of the Xu. Observing that the probability 
for the event ^ On , Vi = 1, . . . , n'} is lower bounded by 1/c > uniformly in n, we obtain 

P((G})=nF/|I = /) ^cF{Z ^caJi-^°). 

The latter probability can be estimated using Markov's inequality and the fact that E[Z~™] = Um 
is finite (cf. p.9p ). Indeed, for every m > 0, F{Z ^ t) ^ Umt "^ ■ Thus, we have shown that for 
every p > there exists a constant Kp such that 

P((G})"nF/|I = /) Kpn-f . (3.16) 

Next, we set F/ = G} n F/ and wc claim that 



E 



S-^-Ti\X^I 



0({n-d)-^'"), (3.17) 



Indeed, on f / we have 5 > ^ 6*0 ^ |(n — d)^/" Z and therefore, for some constant Ci, 

5^2; fj|2: = /j ^ci(n-d)-*/"E[Z-2|i = /] . 

Using independence as before, and recalling that the event ^ a„ , Vi = l,...,n'} has 

uniformly positive probability we have 

E \X = I\ ^ cE[Z"2] ^ _ 
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This proves ((3Tf)) . 

Now, for the event Markov's and Cauchy-Schwarz' inequahties lead to 

f\dist^{Y,W')- S\ 



dist^{Y,W') S/2;Ti\I = I) 



^ 2E 



\dist^{Y,W')- S\ 
S 



^ 1/2; r/|Z = / 



s; 2WE[|dist2(r,W^')-'5'|2|I = /]E 



5-2; rHi = / 



Hence, if Gf denotes the event {dist^(r, W) ^ 5/2}, we deduce from ((3l3t and (|3Tf)) 



(3.18) 



Note that, using n — d ^ , the last expression is certainly 0{n » ^7))^ Qj^ the other hand, 
by p.l7p and Cauchy-Schwarz' inequality 



E 



dist"2(X,l^); G?nr/|i = / s;2E s-^;ri\i = i =o((n-d)-2/"^ 



(3.19) 



To conclude the proof we take Ej = Gj nfj = G] Gj nTj. We have 

P{{Eiy\I^I) i^F {{Tiy \I ^ I) + P {{G})'' nVi \I ^ I) + P {{Gjy n G] DTi \I = I) . 
From ((XTBI and (|XTS)) we see that, 

p((G})'=nr/|i = /) +p((G?)'=nG}nr/|i = /) = o (n-i (1-^7) j ^ 

and all it remains to prove is an upper bound on P {{TjY \ X = I). By independence, as before 

P((r/)^|I = /) s;cP((r/)^ |Xh| s; a„ , Vz = 1, . . . ,n') . 

From ((XTi|) we obtain P ((r/)= 1 1 /) ce""'. This ends the proof. □ 

3.4. Uniform integrability. Let z E C and ct„ ^ • • • ^ ai be the singular values of An — z with 
An defined by (jl.4p . For < (5 < 1, we define Ks ~ [5^5~^]. In this paragraph, we prove the 
uniform integrability in probability, meaning that for all e > 0, there exists (5 > such that 



I \n{x)\vA^-z{dx) > £ 



0. 



(3.20) 



From lemma [5TT1 with probability 1 there exists cq > 0, such that for all n, 

\ii'^{x)iyA„^z{dx) < Cq. 



It follows from Markov inequality that for all t ^ 1, \n{x)i^A„-z{dx) < cq/ \nt. The upper part 
(5-^,00) of p. 201) is thus not an issue. For the lower part (0,(5), it is sufficient to prove that 

^ n—l 



j=0 



converges in probability to for any sequence ((5„)„ converging to 0. From lemma [3?^ we may a.s. 
lower bound an-i by cn~^ for some constant c and all integer n ^ 1. Take < 7 < a/4 to be fixed 
later. Using this latter bound for every 1 ^ i < n^^'' , it follows that it is sufficient to prove that 

^ n— 1 
i=[n'- — 'J 

converges in probability to 0. We are going to prove that there exists an event F„ such that, for 
some S > and c > 0, 

P((i^„)^) ^cexp(-n*), (3.21) 

and 



E 



(3.22) 
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We first conclude the proof before proving p.2ip - (l3.22p . From Markov inequality, and (j3.22p . we 
deduce that 



/r) \ - + 1 



1/C — +1) 

If follows that there exists a sequence Sn — Sn tending to such that the probability that 

F{an-[nE„] ^ ^n) converges to 0. We obtain that it is sufficient to prove that 

i=Lni-Tj 

given Fn converges in probability to 0. However, using the concavity of the logarithm and p.22p 
we have 



E 



Le„nJ 



= ci (-e„lne„ + £„ + 0(n ^)) . 

It thus remain to prove p.21l) - p.22l) . Let Bn be the matrix formed by the first n — [i/2j rows of 
o-niAn — zl). li (j'l ^ ■ ■ ■ ^ '^'n~\i/2\ ^'^^ singular values of then by the Cauchy interlacing 
Lemma IB.41 



By the Tao-Vu negative second moment lemma |B.3[ we have 

where distj is the distance from the j-th row of _B„ to the subspace spanned by the other rows of 
Bn- In particular, 

n-\i/2\ 
% 2 2 \ ^ T 2 

2^n~r «^ a« 2^ distj . 

Let Fn be the event that for all 1 ^ j ^ n — [j/2j, distj ^ n^^^^'*')/". Since the dimension of the 
span of all but one rows of Bn is at most d^n — i/2, we can use proposition 13.31 to obtain 

ni.FnT) exp(-n*) , 



for some 5 > Q. Then we write 



n-\i/2\ 

\^nl^tF^^al dist-^i^^ 



Taking expectation, we get 



; Fn] ^ 2aln¥. [dist^' ; F„] 



(3.23) 



Since we are on Fn we can always estimate disti ^ n*^^^^''-'/". By introducing a further decom- 
position we can strengthen this as follows. Recall that from proposition 13.71 there exists an event 
E independent from the rows j ^ I such that ¥{{EY) ^ 7i-(i-27)/" and for any C C" with 
dimension d < n — n^~'^ one has 

E[dist(i?, W)-^ ; £;] s$ c (n - (i)-^/" . 

Here R is the first row of the matrix Bn- By first conditioning on the value of the other rows of 
Bn and recalling that the dimension d of the span of these is at most n — i/2 ^ n — 2n^~^ , we see 
that 

E[distr^ ; E] = fr^/" 
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Therefore 

E [distr^ ; i^„] ^ E(distj;2 ; E) + ¥{{Ey) n-2(i-27)/a 

<C2 . (3.24) 

Now, if 7 < 1/6 we have 3(1 - 2-f)/a > 2/a and therefore n-3(i-27)/a ^ r^j^^g^ (13:24)) 

irapHes 

E [distj^^ ] Fn]^2 C2 r . (3.25) 

From p.23p we obtain 

E[ia,^^,;F„] < 2 ca n«-2/" . 
From (H2) it follows that (IX^ holds. This concludes the proof of 

3.5. Proof of theorem 11.21 We may now invoke theorem 1 1.1 1 and p.20p . From lemma 1X^21 /i^^ 
converges in probability to /^q, where for almost all z G C, 



t^Pc(^)= j ln(a;)zyQ_^(dx). 

Let us upgrade this convergence to an a.s. convergence. By lemmas [3 . 1 1 and fA . 1 1 it is sufficient 
to prove that for every z S C, a.s. 

lim [/^^ (z) = U^^{z). 

Let us fix z G C from now on. Since L = Uf^^ (z) is deterministic, it actually suffices to show that 
there exists a deterministic sequence i„ such that a.s. 

lim ((7^^^ (z) - L„) = 0. (3.26) 

n— >-oo 

Now, by lemmas [3.11 and 15^^ there exists h> Q such that a.s. for n ^ 1, 

supp(i^A„-2/) C [sn{An - z/),si(A„ - z/)] C [n~^n'']. 
Denoting /„ : a; G M-|_ i-^- /„(a;) = l[„-b „6](a;) log(x), we get that a.s. for n ^ 1, 

/>00 /"OO 

^A'A„(^)==-/ log(s) dz^A„-z/(s) = - / fnis)di^A„~ziis). (3.27) 
Jo Jo 

The total variation of /„ is bounded by clogn for some c > 0. Hence by lemma \C~2[ if 

X„ :=E / fn{s)dvA„-ziis), 



then we have, for every e > 0, 



^ e ^ 2exp -2 



/„(s)dz/A„-27(s) - L 
In particular, from the first Borel-Cantelli lemma, a.s., 

lim I / /„(s) dvA„-zi{s) ~ Ln] = 0. 



2 



(clogn) 



Finally, using (I3.27|) . we deduce that (j3.26p holds almost surely, as required. 

4. Limiting spectral measure 

In this section, we give a close look to the resolvent of the random operator on the PWIT and 
we deduce some properties of the limiting spectral measure /Iq. For ease of notation we set 



^ 2 



and define the measure on 
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4.1. Resolvent operator on the Poisson Weighted Infinite Tree. In this paragraph, we 
analyze the random variable 

'a{z,r]) 6(2,77)' 
/(z,?7) c{z,ij)^ 

By lemma \2l2[ for t G M+, a{z,it) is pure imaginary and we set 

h{z,t) — 3m{a{z,it)) — —ia{z,it) e [0, i^"'"]. 

The random variables a(z,ri) and h{z,t) solve a nice recursive distribution equation. 

Theorem 4.1 (Recursive Distributional Equation). Let U — U{z,rj) E H+, t E R+. Let Lij he 
the distribution on ofa{z,ri) and L^.t the distribution ofh{z,t). 
(i) Ljj solves the equation in distribution 

where a, {ak)kefi o.'^'d (a'j,)fcgN are i.i.d. with law Ljj independent of {£_k}keN, {Cfe}fceN two 
independent Poisson point processes on K+ with intensity . 
(a) Lz^t is the unique probability distribution on [0,cxi) such that 

\z\^ + {t + EfeeN ^khk) (t + EfceM ^'kK) 

where h, {hk)km and {h'i^)keN are i.i.d. with law Lz,t, independent of {^k}keN, {Ck}keN 
two independent Poisson point processes on K+ with intensity Aq. 
(Hi) For t = there are two probability distributions on [0,oo) solving (j4.2p such that E/i"/^ < 
00: Sq and another denoted by Lz,o. Moreover, for the topology of weak convergence, L^^t 
converges to Lz^ as t goes to 0. 



We start with an important lemma. 



Lemma 4.2. For every U = U{z,ri) E IHI+, [ w " \ is equal in distribution to 



b 



b' 



c 



-z 



where a, {ak)ken and (a'j,)fegN are i.i.d. with law Ljj independent of {^k}keth {CfejfceN two inde 
pendent Poisson point processes on with intensity Aq. 



Proof of lemma \4.2\ Consider a realization of PWIT(2^e) on the tree T. For fc £ N, we define 
Tk as the subtree of T spanned by kN^ . With the notation of lemma [2751 for k E N, RskiU) = 
{Bk{z) — ri)~-^ is the resolvent operator of Bk and set 

R{u)kk = nkRBAu)K = (l' 



Cfc 



Then, by lemma 1^31 and (|2.13p . we get 
R{U)00 = -(c/ + E(n A - 




D 



\ ken 

(rr,fEkeNi^-^k)\yk\-^/''Ck 

[ \ EkeN^k\yk\-'/'^ak, 

-1 fv + Y.keN^k\yk\^^^"ak -z 

V v + EkeNi^~^k)\yk\-'^' 

iv + Eken^klykl-^^'^ak) {v + EfceN(l - £fc)l2/fcr'/"cfc) 




Now the structure of the PWIT implies that (i) Ok and Ck have common distribution Lu; and 
(ii) the variables {ak,Ck)k<£N are i.i.d.. Also the thinning property of Poisson processes implies 
that (hi) {ek\yk\^^^°'}keN and {(1 — ek)\yk\^^^"}keN are independent Poisson point process with 
common intensity A^. □ 
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The next well-known and beautiful lemma will be crucial in the computations that will follow. 
It is a consequence of the LePage-Woodroofe-Zinn representation of stable laws [35^: see also 
Panchcnko and Talagrand [HI Lemma 2.1]. 

Lemma 4.3. Let {^fcjfeeN be a Poisson process with intensity A^. // (Yk) is an i.i.d. sequence of 
non-negative random variables, independent of {£,k}kei'h such that K[Yf] < oo then 

fcSN fcsN 

where S is the positive j3-stable random variable with Laplace transform for all x ^ 0, 

E exp(-a;S') = exp (-r(l - /3)a;'^) . (4.4) 
Proof of lemma \J73\ Recall the formulas, for ?/ ^ 0, ry > and < r; < 1 respectively, 

/"OO /"OO 

y-'' = r(?7)-W x'^-^e-'^^dx and y" = r(l - 77)-^ / a;^"-i(l - e~"^)(ix. (4.5) 
Jo Jo 

From the Levy-Khinchin formula we deduce that, with s ^ 0, 

Eexp (^-sY^^kYi}j = exp (e (e'''''^' - l)f3x''^''^dx^ 

(-r(l - l3)s^E[Yf]) . 

□ 



fe 

— exp I 



Proof of theorem \4.1\ Statement (i) is contained in lemma W/I[ For (m), let t > and h a solution 
of (|4.2p . Then h is positive and is upper bounded by 1/t. By lemma H31 we may rewrite (|4.2p as 

hi t+nh^]^f^s 

|z|2 + {t + E[h0]^/f^S) {t + E[h^]^/0S') ^ ■ ' 

where S and S' are i.i.d. variables with common Laplace transform (|4.4p . In particular, E[/i'^]-'^/^ 
is solution of the equation in y: 



^=E 



t + yS ^ ^ 



\z\^ + {t + yS) {t + yS'), 
Since t > 0, E[h^] > 0, it follows that E[ft,^]^/'' is solution of the equation in y: 



^z\^ + {t + yS) {t + yS') 
For every S,S' > 0, the function y M' |^|2^(t^yg)(^_|_^,5<) is decreasing in y. It follows that 

ty-' + S 



y ^ 



\z\^ + {t + yS) {t + yS') 



is decreasing in y. As y goes to it converges to 00 and as y goes to infinity, it converges to 0. In 
particular, there is a unique point, y^.{\z\'^,t) of such that (|4.7p holds. This proves (m) since from 
gH), the law of h is determined by E[h^]^/^ = y^{\z\'^ ,t). 

For Statement (Hi) and t = 0, then /i = is a particular solution of (|4.2I) . If h is not a.s. equal 
to 0, then E[h^]^/^ > and the argument above still works since, for every s, s' > 0, the function 
y I— >■ ^^^i^y'iggi is decreasing in y. We deduce the existence of a unique positive solution 2/*(| 2:^,0) 
of (|4.7p . We also have the continuity of the function 1 1— > 2/*(|zp, i) on [0, 00). Finally 

hly.{\z\^0)S/{\z\^+yU\z\',0)SS'), 
and from (|4.6p . it implies the weak convergence of L^^t to L^^o- D 
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4.2. Density of the limiting measure. In this paragraph, we analyze the RDE (j4.3l) . For all 

t > 0, let Lz,t be as in theorem 14. II From Equation (|4.6p . h may be expressed as 

, _d t + y^S 

" \z\^ + {t + y,S){t + y^S') 

where S and S' are i.i.d. variables with common Laplace transform (|4.4p and := i) is the 

unique solution in (0,oo) of (|4.7I) (uniqueness is proved in theorem 14. ip . We extend continuously 
the function y*{r,t) foi t — by defining ?/*(|zp,0) as the unique solution in (0,cx)): 

Lemma 4.4. The function : [0, oo)^ — >■ (0, oo) is . For every t ^ 0, the mapping r i— >■ y*{r, t) 
is decreasing to 0. 

Proof. For every t ^ 0, the derivative in y > of the function E ^ \ z\'^+{t+y^{t+yS') ) 

_ Bty-^^ jty-'+Sr-' ity-^+SnS{t + yS') + S'it + yS)) 

{\z\^ + it + yS){t + yS')f {\z\^ + it + yS){t + yS')f+' 

The last computation is justified since all terms are integrable, indeed we have 

(ty-i + 5)^-1 ^ y-^+^ ^ y-^'' 



(|zP + (t + y^)(t + y^'))^ " it + yS)it + yS'f " ^5"^ 
and from (14. 5L for all 77 > 0, 

Similarly, for the second term of (j4.9p . we write 

fty-i + SfjSjt + + S'jt + y^)) ^ _i ^ft + y^O + y(t + yg) 
{\z\^ + {t + yS){t + yS')f^' " ^ (i + y5)(t + y5')^+^ 

1 1 5" 

< y z — :77b +y 



{t + yS)it + yS'f ■ it + yS'f+' 

< y-^-^S'-^ + y-^-'S'~^ 

The expression (14. 9p is finite and strictly negative for all y > 0. The statement follows from the 
implicit function theorem. □ 



From (gj]), for aU t > 0, 

b{z, it) 



\z\^ + {t + y.,{\z\^,tt)S) (t + y.(|z|2,*t)5')' 
By lemma IT4l we may also define 

d ^ 

b{z, 0) — lim6(z, it) = — - 



tlO ' ' ' |z|2 + y2(|z|2,0)55'- 

For ease of notation, we set y*(r) = y*(r, 0). Since dz — 1, d\z\^ = z, we deduce that 
~Edbiz,0) = m ^ 



|z|2+y2(|z|2)55' 

= E + y2(|z|2)55')"' - \z\'E {\zf + y2(|z|2)55')"' 

-2|z|V(|z|>:(|znE55' (|zp + y.^dzh^^O"' 
^ {yl{\z\')-2\zU{\z\')y:{\z\'))E (4.11) 

(l^r + y*(kr)'S'5 ) 

The latter is justified since 

SS' (|z|2 + y^SS'y^ ^ y-\SS')-'- 
is integrable from (|4.10p . The next lemma is an important consequence of Theorems 12.131 and 11.21 
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Lemma 4.5. The following identity holds in 2?'(C); 

= --dEb{-,0). 

Therefore the measure is isotropic and has a continuous density given by I/tt times the right 
hand side of (|4.11l) . 

Proof. Let i?„ be the resolvent matrix of the bipartized matrix of An defined by (11.41) . By 
theorem 12. 131 and lemma [221 for alH > and z G C, 



lim ERn{U{z, it)) II = 



im{z,t) miz,it) 

Eb{z,it) iEhiz,t) 



From theorem 12. 141 Eva^-z converge weakly to Va.z and, by lemma IXTl for all t > 0, 

lim - [ Inix"^ +t'^)EuA„-zdx) ^ - [ \n{x'^ + t'^)i^a,z{dx). 

From Equation (I3.20[) . J \n{x)i'a,z{dx) is integrable. We deduce that for all zq € C, there exists 
an open neighborhood of zq and a sequence (in)n^i converging to such that for all z in the 
neighborhood, 

E«.,t,(..«„))„ . (fM^-;) f,5,<;'°',) , (4.2, 

and 

lim i / Inix"^ + tl)Ei'A -z{dx) = / \a{x)va z{dx). (4.13) 

n^oo 2 ,/ I 



Moreover from theorem 11.21 Equation p.20p . lemma ET2l in T>'{C): 

A / ln(a;)i^Q^2(dx) = 27r/ia. (4-14) 



On the other hand, \ J ln(a;^ + t^)vA^~z{dx) = ^ In | det(i3(z) - it/2n))|, and from (pTS 



A^y" ln(j;2 +i^)Ei/A„_2, (da;) = -2aE&i(z, it). 
The conclusion follows from dHH]), ((11^ and □ 

It is possible to compute explicitly the expression (j4.1ip at z — Q. 
Lemma 4.6. The density of at z — is 

i r(i + i//3)2r(i + /3)i/^ 
TT r(i-;3)i//3 

Proof. By definition, the real ?/*(0) solves the equation 

1 = e( = y-^^ES-^ - 1^ f x^-'e'^^'-^^-'dx. 



With the change of variable x i-^ x^ and the identity zT{z) = T{1 + z), we find easily, ES~^ — 
(r(l-/3)r(l + /?))-' and 

y,(0) = (r(l-/3)r(l + /3))-5?. 

We also have 

- /e-i^^i-^^'dx = i I x^/P~^e-^dx = + ^^^^ 

i /3r(i - /3)i//3 y ^ '''' r(l-/3)i//3' 

where we have used again the identity zT{z) = r(l + z). Then the right hand side of (j4.11l) at 
z = is equal to 

ymy:^\o)E{ssr' = y7\o) {^s-'Y ■ 

□ 
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4.3. Proof of theorem 11.31 In this subsection, we prove the last statement of theorem 11.31 fthe 
first part of the theorem being contained in lemmas HTSl 14. 6|) . We start with a first technical lemma. 

Lemma 4.7. Let < /? < 1, (5 > 0, and f be a bounded measurable M-|_ — )• M function such that 
f{y) = 0{y^'^^) as y i 0. Let Y be a random variable such that F{Y ^ t) = L{t)t^^ for some 
slowly varying function L. Then as t goes to infinity 



E/ 



PL{t)t-f' / f{y)y-P-^dy. 



Proof. Define Yt — Y/t. We fixe > and consider the distribution P(Y'f e ■\Yt ^ e). By assumption, 
for s > e, 

HYt ^ s\Yt ^ e) ^ (s/e)-^ . 

In particular, the distribution of Yt given {Yt ^ e} converges weakly as t goes to infinity to the 
distribution with density (3x~^~^e^dx. Since / is bounded and L slowly varying, we get 



(t 



- nYt^e)E[f{Yt) 

poo 

~ Liet)e-^H~P J f{y)f3y-^-^e^dy 



- l3L{t)t- 

Finally, by assumption, for some constant, c > 0, 

'Y' 



f{y)y-^-Hy 



E 



/ 



< ct 



-13-5 



nY^+H{Y^et}]- 



Thus by lemma [CTTl for some new constant c > and alH ^ 1/e, 



E 



Y 



/ ( 7 ) '^{Yiiet} 



^ ct-P-'L{et){etf = ct-^L{t)e'^. 



We may thus conclude by letting t tend to infinity and then e to 0. 



□ 



Lemma 4.8. Let S be a random variable with Laplace transform (|4.4p . There exists a constant 
Co > such that as t goes to infinity, 

ESH{s^t} =lnt + co + o(l). 

Proof. Let gp be the density function of S. From Equation (2.4.8) in Zolotarev [SB], gp has a 
convergent power series representation 



9P 



^ OO 

{x) = -Y.{-ir-' 



r(n+l)r(l-/3)' 



■ sin(7rn/3)x 



The Stirling formula r(a;) ^^-i-oo y ~7 (f)^ implies that the convergence radius of the series is 
+00. Recall that r(/3 + 1) = l3T{fi), and the Euler reflection formula, r(l - /3) sin(7r;3)/7r = r(/3). 
Thus, as X goes to inflnity, 

gp{x)=Px~^~^+0{x-^f'-^). 



□ 



The next lemma is a consequence of the Karamata Tauberian theorem. 
Lemma 4.9. As t goes to infinity, 

P(S'S" ^ t) - I3t-^\nt, 
and, with ci — f3'^ Jq° + l)^^x~^dx, 

SS' 



E 



{t + S5")2 



cii-^-^lni. 
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Proof. Let x > 0, since S and S' are independent we have 

Eexp{-xSS') = Eexp (-r(l - /3)x'^S'^) . 

From Corollary 8.1.7 in we have as t goes to infinity, ¥{S > t) ^ t"^ . In particular, we have 
P(S''^ > t) ^ and a new application of Corollary 8.1.7 in [91 gives as x J, 0, 

1 - Eexp(-a;5^) - xlax'^ . 

We obtain 

1 - Eexp(-a;55') - r(l - ln(r(l - - ^r(l - /3)x'^ ln(a;"i). 

We then conclude by a third application of Corollary 8.1.7 in |9j. The second statement is a 
consequence of lemma 14.71 □ 

The next lemma gives the asymptotic behavior of y*{r) as r goes to infinity. 

Lemma 4.10. There exists a constant C2 > such that as r goes to infinity, 

y*(r) ^ C2\fre^''^ 1"^ . 

Proof. From Equations (14.51) . (|4.8|) . we have with y^, = 



L^Jx^-'Eexp i^-^-xy^^S')dx 



(4.15) 



r(i + /3)r(i - /?)yf 

By lemma [4.41 lim^-i-oo y* ('') = 0. Hence, from the above expression, we deduce that the term 
ry^'^ goes to infinity as r goes to infinity. Define 

^^^^ = r(i + /3)r(l -(3) 1 e'"e'«r(r-',)i/^ dx ^ loiy) + h{y) + h{y), 
with /o(2/) = 

= r(i + mi -^j /-'^^"- 

The function / is increasing and limj,_j.oo ^(y) < oo. Also, the function /q is equal to in a 
neighborhood of 0. By lemma l477l we get as t goes to infinity, 

mo{s/t)^aot-^, 

for some positive constant oq = r(i+ff)r(i-/3) /i / ^"^e !/r(i-/3)i//3 Py-P-^dxdy. By lemma 

E[/i(S'/t)] t^'^lnt + cor'^ +o(l). 
Also, from Laplace method, hiy) ^ -r(2^)r(l - f3)'^y'^^ as y goes to 0. By lemma liJl 

¥.h{S/t)^a2t~^, 

with 02 = r(i+/3)r(i-,3) /o ~ (3y~^~^dxdy. Hence, for t — ry^^, we get from 

mB 

yf = {ry^^r^ ln(ry-2) + (co + ao + «2)(r2/,-^)-'^ + o((rj;72)-/^). 

In other words, 

r^ = Hry-'^) + (cq + flo + 02) + o(l). 
We conclude by setting C2 = exp((co + ao + a2)/2). □ 
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Lemma 4.11. As r goes to infinity, 

w/ierec3 = 2 xe'^'e -ni-o)^^" ps-f^-^dxds / {T{1 + P)T{\ ^ P)) . 

Proof. We define 

^ ^ ivrhs)' = w) I 

From the implicit function theorem 

, drG{y^,r) 

y*^''' dyGiy.,r)- 

We have 



y2^*+ir(i + /?)r(i - /3) 

The Laplace method implies that, as t goes infinity, 

xe-^e r(i-«i/'3 rfa; _ r(2/3)r(l - 
Thus by lemma HTTl we deduce that 

xe'^Ee sr(i-«i/^ _ i-/3 / / a;e-=^e =r(i-«i/^^ Ps'^-^dixds - ct"''. 

Applying the above to < = ry^'^{r) we deduce, with C3 = 2c/(r(l + /3)r(l — /?)), 

dyG{y*,r) C3r"'^y7^(r). 

Similarly, the derivative of G with respect to r is 

Once again, Laplace method implies that, as t goes infinity. 



i//3e-^e r(i^«i//3 rfa; _ r(/3 + l)r(l - 



In particular, for all e > there exists to such that 
il~e)t-^-^ESH{s^t/to} 



1 /■ ^VPf 



s; (i + £)i-^-Wi{s^t/t„}. 

By lemma [ 



It follows that for some ti > to and all t ^ ti, 

(l-2£)t-^-ilnt ^ r(l-/?)VWr(l + /3) y ^'/^e"'IEe"-(-«^^''5-il{s^t/to}rf^ < (l+2£)t-^-i Int. 
On the other hand, for some constant c > and alH ^ 1, 

x^/'^e-^'Ee sr(i-«i//3 S-H^s;,t/t„}dx < / a;i/'3e-=^dxP(5 ^ t/to) ^ ct-'^-H^+\ 



We thus have proved that 



1 /■ ^l//3t 

x^/'^e-^Ee sr(i-^)i//3 5-1^2; - t^/^'i Int, 



r(l - /3)i//3+ir(H- ;3) 
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and 

The statement follows. □ 

Proof of theorem \l.Si From Equation (|4.1ip and lemma the density at r — |zp is equivalent 
to I/tt times 



It remains to apply Iemmas l4.10l and l4.lll and set the multiplicative constant to be c = 27r ^c^ ^cic^^ . 

□ 

Appendix A. Logarithmic potentials and Hermitization 

Let V{'C) be the set of probability measures on C which integrate ln|-| in a neighborhood of 
infinity. For every fi G 'P(C), the logarithmic potential C/^ of /i on C is the function [/^ : C — > 
[—00, +00) defined for every z G C by 

U^{z) = [\n\z- z'\ fi{dz') = (In H * fi){z). (A.l) 



Note that in classical potential theory, the definition is opposite in sign, but ours turns out to be 
more convenient (lightweight) for our purposes. Since In |-| is Lebesgue locally integrable on C, one 
can check by using the Fubini theorem that C/^ is Lebesgue locally integrable on C. In particular, 
< 00 a.e. (Lebesgue almost everywhere) and G 2?'(C). Since ln|-| is the fundamental 
solution of the Laplace equation in C, we have, in I?'(C), 

MJ^, = 2^T^l. (A.2) 
Lemma A.l (Unicity). For every n^v € 7^(C), ifU^ — U^, a.e. then 11 = v. 

Proof. Since U^, = in X>'(C), we get AC/^ = MJ^ in X>'(C). Now (|X2|) gives ^ = 1/ in X>'(C), 
and thus ^ — u as measures since /i and v are Radon measures. □ 

If A is an n X n complex matrix and Pa{z) :— Aet{A — zl) is its characteristic polynomial, 

U^^ [z) - /in \z' - z\ fiA{dz') = -ln \dct{A -zl)\ = -In \Pa{z)\ 
Jc n n 

for every z € C \ {Ai(yl), . . . , A„(A)}. We have also the alternative expression 

U^Az) ^ -\ndct{^ (A ^ zI){A- zl)*) = / Ht) ,^A-zi{dt). (A.3) 

Jo 

The identity above bridges the eigenvalues with the singular values, and is at the heart of the 
following lemma, which allows to deduce the convergence of /i^ from the one of i^a-zI- The 
strength of this Hermitization lies in the fact that contrary to the eigenvalues, one can control 
the singular values with the entries of the matrix. The price paid here is the introduction of 
the auxiliary variable z and the uniform integrability. We recall that on a Borel measurable space 
(E, £), we say that a Borel function / : i? — R is uniformly integrable for a sequence of probability 
measures {r]n)n^i on E when 

lim lim / \f\dfln ~ 0. 

t->oon^oo 7{|/|>t} 

We will use this property as follows: if 77,1 ~^ 77 and / is continuous and uniformly integrable 
for {r]n)n^i then / is 77-integrable and lim„_>.oo // — Jfv- Similarly for a sequence random 
probability measures {rin)n^i we will say that / is uniformly integrable for {rin)n^i in probability, 
if for all e > 

lim Ih^ P I / \f\dr]„ > s] =0. 



A proof of lemma below can be found in [12] which covers the "a.s." case, the "in probability" 
case being similar. It relies only on the unicity lemma lA.li the classical Prohorov theorem, and 
the Weyl inequalities of Lemma IB. 51 linking eigenvalues and singular values. 
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Lemma A. 2 (Girko's Hermitization method). Let {A„)n^i be a sequence of complex random 
matrices where An is n x n for every n ^ 1. Suppose that for Lehesgue almost all z G C, there 
exists a probability measure on [0, oo) such that 

(i) a.s. (yAn-zi)n'^\ tends weakly to Vz 

(ii) a.s. (resp. in probability) \b.{ ) is uniformly integrable for {vA„-zi)n>i 
Then there exists a probability measure fj, G V{<C) such that 

(j) a.s. (resp. in probability) (ma„)„^i converges weakly to jj, 
(jj) for a.a. z E C, 

U^{z) = / \n{t)iyz{dt). 
Jo 

Appendix B. General spectral estimates 
Lemma B.l (Basic inequalities [3.2 ). If A and B are n x n complex matrices then 

si{AB) ^ si{A)si{B) and si{A + B) ^ si{A) + si{B) (B.l) 

and 

max \si{A) - Si(B)\ si{A - B). (B.2) 

Lemma B.2 (Rudelson-Vershynin row bound [451 d^j). Let A be a complex n x n matrix with 
rows . . . ,Rn- Define the vector space R-i := span{i?j; j ^ i}. We have then 

n"^^"^ min dist(i?i, i?_i) ^ s„(A) ^ min dist(i?i, 

Recall that the singular values si{A), . . . , Sn'{A) of a rectangular n' x n complex matrix A with 
n' ^ n are defined by Si{A) :— Xi{vAA*) for every 1 ^ i ^ n' . 

Lemma B.3 (Tao-Vu negative second moment [50, Lemma A4]). If A is a full rank n' xn complex 
matrix (n' ^ n) with rows . . . , Rn' , and R-i := span{i?j; j ^ i}, then 



^s,(A)-2 = ^dist(i?„i?_, 



i=l i=l 

Lemma B.4 (Cauchy interlacing by rows deletion [32]). Let A be an n x n complex matrix. If B 
is n' X n, obtained from A by deleting n — n' rows, then for every \ ^i ^n' , 

s,{A) ^ s,{B) ^ s, [A). 
Lemma B,5 (Weyl inequalities [53]). For every n x n complex matrix A, we have 

k k n n 

l[MA)\^l[s,iA) and n^'(^)^ni^^(^)l (^-3) 

i—1 i—1 i—k i—k 

for all 1 ^ k ^ n. In particular, by viewing |det(A)| as a volume, 

n n n 

I det(A)| = II \XkiA)\ = II .Sfe(A) = H dist(i?fc,span{i?i, . . . , Rk-i}) (B.4) 

k=l k=l k=l 

where i?i,...,i?„ are the rows of A. Moreover, for every increasing function if from (0,oo) to 
(0,00) such that t <^(e*) is convex on (0, oo) and ip{Q) := limj_j.Q+ Lp(t) = Q, we have 



^^(|A.(A)n^^^(s,(A)2) (B.5) 

i=l i=l 

for every 1 ^ k ^ n. In particular, with ipit) ~ f^^^ , r > 0, and k — n, we obtain 

n n 

5]|A,(A)r 5:^s,(A)'-. (B.6) 



fc=i fc=i 
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Lemma B.6 (Schatten bound [551 proof of Theorem 3.32]). Let A be an n x n complex matrix 
with rows . . . , Then for every < r ^ 2, 



k=l k=l 

Appendix C. Additional lemmas 



(B.7) 



We begin with a lemma on truncated moments. We skip the proof since it follows from an 
adaptation of the proof in the real case given by e.g. Feller [231 Theorem VIII. 9. 2]. 

Lemma C.l (Truncated moments). // (HI) holds then for every p > a, 

E[|Xnri{|x,,Kt}] ^c(p)L(t)if-" 
where c{p) := a/{p — a). In particular, we have 

E[|Xnri{|x,,Ka„}]-c(p)^. 

We end up this section by a result on the concentration of the spectral measure of Hermitian or 
Hermitized random matrices, mentioned in [13] . The total variation norm of / : M — !■ R is 



I/I 



TV 



sup^ |/(a;fe+i) - f{xk)l 



where the supremum runs over all sequences {xk)k£Z such that Xk+i ^ Xk for any fc e Z. If 



/ = 1 



( — OO , 6*] 



for some real s then ||/||jy =^ 1, while if / has a derivative in L^(R), we get 



i/ii 



TV 



\r{t)\dt. 



The following lemma comes with remarkably weak assumptions, and allows to deduce the al- 
most sure weak convergence of empirical spectral measures of random matrices without any mo- 
ment assumptions on the entries. We discovered that this lemma was obtained independently by 
Guntuboyina and Lceb in j30| . where they discuss the relationships with more classical results. 

Lemma C.2 (Concentration for spectral measures). Let H be an nxn random Hermitian matrix. 
Let us assume that the vectors (i?i)i^i^n, where Hi (^ij)i^j^i G C*, are independent. Then 
for any / : K. — !■ M going to at ±00 and such that H/Hrpy ^ 1 o,^d every t ^ 0, 



f d^H - E / / dfiH 



^ t j ^ 2cxp 

Similarly, if M is an n x n complex random matrix with independent rows ( or with independent 
columns) then for any f : — !■ R going to at +00 with ||/||xv ^ ^ '^^'^ every t ^ 0, 



fdi^M-K fdu_ 



^t \ ^ 2exp {-2nt'^). 



Proof. We prove only the Hermitian version, the non-Hermitian version being entirely similar. 
Let us start by showing that for every nxn deterministic Hermitian matrices A and B and any 
measurable function / with ||/||xv = 1' 

rank(A - B) 



fdfiB 



(C.l) 



Indeed, it is well known (follows from interlacing, see e.g. [5T] or [51 Theorem 11.42]) that 

rank(yl-B) 
\\Fa-Fb\\^^ 

where Fa and Fb are the cumulative distribution functions of fiA and respectively. Now if / 
is smooth, we get, by integrating by parts, 

rank(A - B) 



f duA -If dfiB 



f'{t)FA{t)dt- / f{t)FB{t)dt 



\r{t)\dt, 



and since the left hand side depends on at most 2n points, we get (jC.l[) by approximating / by 



smooth functions. Next, for any x — (xi, . . . ,a;„) € X := {(a;i)i^i^„ : Xi € 



I}, let H{x) be 
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the nxn Hermitian matrix given by H{x)ij := Xi,j for 1 ^ j ^ i ^ n. We have fin = M-fr(-fri, ....-&„)■ 
For all X G X and e C'^^ x R, the matrix 

H (yX \ , . . . , 2^2 — 1 1 -^i 1 -^i+l 1 ■ • ■ 5 '^n) H (yX \ , . . . , . . . , Xn) 

has only the «-th row and column possibly different from 0, and thus 

rank(_ff (xi , . . . , a;^_i , , Xi+i , . . . , x„) - _ff (xi , . . . , s^-i , a; ■ , Xt+i , . . . , x„)) < 2. 



Therefore from lCll we obtain, for every / : R M with ||/||xv ^ 1' 

The desired result follows now from the Azuma-Hoeffding inequality, see e.g. j38j Lemma 1.2]. □ 
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